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1. Introduction 

In [B] and other related papers, Srivastava et al introduced a new method for 
analyzing the shapes of absolutely continuous functions [0,1] —» M. N . By “shape” 
we mean that under this analysis, two such functions are considered equivalent if 
they only differ by a reparametrization, i.e., by composition with a diffeomorphism 
[0,1] -A [0,1]. The method is based on producing a bijection between the set of 
absolutely continuous functions (starting at the origin) and L 2 (J, R N ), and then 
transferring the L 2 metric back to the set of absolutely continuous functions. The 
L 2 function corresponding to a given absolutely continuous function is called its 
square root velocity function (SRVF), and the general method is referred to by the 
same name. The result is a complete metric on the space of absolutely continuous 
functions starting at the origin. Furthermore, with respect to this metric, the group 
of diffeomorphisms acts by isometries. This makes it possible to mod out L 2 (I,M. N ) 
by an appropriate group of reparametrizations, resulting in a quotient space that 
is also a complete metric space. 

The SRVF metric and the corresponding quotient construction have proved quite 
useful for analyzing shapes of functions and curves for several reasons: 

• The metric has a compelling geometric interpretation as an elastic metric 
(see i), under which optimal deformations minimize a combination of 
bending and stretching. 

• It provides a very effective solution to the classical problem of aligning two 
functions R —► R by warping their domains. (See Tucker et al [5].) 

• It can easily be adapted to a method of comparing closed curves, which 
comprise a complete subspace of the metric space of all curves. These 
closed curves are especially important because they occur as outlines of 
images. (See [6].) 

• With some modifications, it can effectively be adapted to the analysis of 
curves up to affine transformation. (See Bryner et al [2].) 

A fundamental problem that arises in the implementation of this method is the 
“optimal matching” problem: Given two functions / —> R w , determine reparametriza¬ 
tions of these functions that achieve the infimum of the distance between the two 
corresponding orbits under the reparametrization group. Finding such an opti¬ 
mal matching is important not only because it results in a precise computation 
of the distance between two orbits, but also because it allows one to find shortest 
geodesics in the quotient space. In theory, we do not know whether such a pair 
of optimal reparametrizations always exists! In most previous implementations, a 


l 


2 


SAYANI LAHIRI, DANIEL ROBINSON, AND ERIC KLASSEN 


solution to this optimal matching problem has been approximated using a dynamic 
programming algorithm (once again, see 0). 

The current paper has two primary goals: (1) to establish the theoretical under¬ 
pinnings of the SRVF method, especially the delicate quotient construction referred 
to above and (2) to exhibit an algorithm that provides a precise solution to the 
optimal matching problem for continuous piecewise linear functions. The set of 
piecewise linear functions is very useful because it is the simplest way of interpo¬ 
lating functions for which we have only a finite set of data points, and because it 
is dense in the space of absolutely continuous functions with respect to the SRVF 
metric. 

There is considerable literature on similar methods for analyzing curves. For 
example in Younes et al EH a representation of planar curves is used that is similar 
to SRVF, but involves the complex square root of the velocity (as opposed to the 
SRVF method, which only takes the square root of the magnitude of the velocity). 

This method results in a beautiful way of handling closed curves, but does not 
generalize easily to curves in R w . Also, in EH, only smooth curves are considered, 
which means that the resulting quotient space is not a complete metric space. 
Sundaramoorthi et al consider a similar metric on the space of smooth planar 
curves in [8]. 

In Bauer et al [I], a whole family of metrics on planar curves is considered, which 
includes both the SRVF metric and the metric in EH as special cases. However, 
this paper also does not generalize to curves in 1 N . 

In Daniel Robinson’s unpublished doctoral dissertation [2], a precise matching 
algorithm is introduced for PL functions I —> R, but it does not easily generalize 
to PL functions I —» R w . Some of the theoretical material from Section 2 is also 
adapted from 0. 

The main advances in the current paper are as follows: 

• A rigorous development of the SRVF metric. 

• A careful development of the quotient of L 2 (/, R^) by the group of reparametriza- 
tions; this includes a characterization of the closed orbits involved in the 
construction. 

• A description and inplementation of an algorithm that gives a precise solu¬ 
tion to the matching problem for PL curves, a class of curves that is dense 
in the space of all absolutely continuous curves. 

One issue that this paper does not address, is the action of the group of rotations, 
0(N,R), on the space of absolutely continuous curves. This part of the theory is 
easier because it involves a linear action by a compact finite dimensional Lie group, 
and there are straightforward analytic methods for handling it (see Srivastava et al 
[6] for details on how to do this). 

The contents of this paper are as follows: In Section 2, we define the square 
root velocity function (SRVF) of an absolutely continuous function / —► R w , and 
we define the group T of reparametrizations; we also define a semigroup T that 
contains T. In Sections 3 and 4 we prove that the closure of each orbit under T can 
be expressed as an orbit under T. This is important since, if we wish our quotient 
space to inherit a metric, the orbits must be closed sets. (In Section 3, this theorem 
is proved for functions / —»• R, while in Section 4 it is generalized to functions 
I -A M. N .) In Section 5, we begin to focus on piecewise linear functions, which 
comprise a dense subset of the set of all absolutely continuous functions with respect 
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to the SRVF metric. In particular, we prove that if we are given two orbits [f/r ] 
and [q 2 ] under the action of T, and if at least one of these orbits contains the SRVF 
of a piecewise linear function, then there exist orbit representatives that realize 
the minimum distance between these orbits. (Such a pair of orbit representatives 
is called an optimal matching of the two orbits.) We also prove that if both of 
these orbits contain the SRVFs of PL functions, then this optimal pair of orbit 
representatives can be chosen to be the SRVFs of PL functions. In Section 6 , we 
begin our discussion of how to produce an optimal matching between PL functions, 
setting up some basic terminology. In Section 7, we prove a theorem establishing 
certain properties that an optimal matching between PL functions must have. In 
Section 8, we give a precise algorithm for producing an optimal matching between 
two PL functions, based on the theorem proved in Section 7. Section 9 gives a few 
examples of optimal matchings produced by the algorithm described in Section 8. 

We thank our colleague Dan Oberlin for several helpful conversations. 

2. Basic Quotient Construction for Curves in R n 

In this paper, we consider absolutely continuous functions I —> K w , where I = 
[0,1]. A function / : [a, b] —» R. is absolutely continuous if and only if it has 
a derivative f almost everywhere, /' is Lebesgue integrable, and for all t £ /, 
fit) = /( 0 ) + Jo f'i u) du. (This is not the usual definition of absolute continuity, 
but it is well known to be equivalent to the usual definition; see, for example, 
Theorems 11, p. 125 and Theorem 14, p. 126 of [5].) Let ACq(I,R n ) denote the 
set of absolutely continuous functions I -A with the property that /(0) = 0. We 
want to compare these functions up to reparameterization. In other words, given 
/ and g in ACq(I,M . n ), we want to consider them as equivalent if there exists a 
“nice” homeomorphism 7 : / -A / such that /07 = g. If they are not equivalent, we 
would like a quantitative measure of how far from being equivalent they are. Let T 
denote the group of functions 7 : I —> I which have the following three properties: 

(1) 7 is absolutely continuous, (2) 7 ( 0 ) = 0 and 7 ( 1 ) = 1, and (3) 7 '(t) > 0 almost 
everywhere. T is a group under composition. Clearly, T acts on ACq(I, R w ) from 
the right by composition. We would like to make the quotient set ACo(I, K JV )/r 
into a metric space in a reasonable way. There are two important issues to overcome 
here. The first is that to get a reasonable metric on a quotient space, it helps if the 
group acts by isometries. The second is that the orbits should be closed sets. We 
tackle these one at a time. 

Before we turn to these two issues, it will be helpful to define a semigroup 
containing T. Let T be the set of functions 7 :/—>■/ satisfying (1) 7 is absolutely 
continuous, (2) 7 ( 0 ) = 0 and 7 ( 1 ) = 1, and (3) 7 '(f) > 0 almost everywhere. Note 
that r is a semigroup, and also acts on ACq(I 7 R n ) from the right by composition. 

We now describe a way to understand the action of T (and, therefore, T) as an 
action by isometries. To do this, begin by defining a function V : R N -A R^ by 

f -V= for x / 0 
Vfx) = < vi*l 

[0 for x = 0 

Denote by L 2 (/, R N ) the space of square integrable functions I —> R N , with 
standard L 2 inner product denoted by ( 91 , 52 ) and distance function defined by 
d(qi,q 2 ) = \J (91 — q 2 , q\ — 92 )- Define a function Q : AC 0 (I,R N ) -A L 2 (I,R N ) by 
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Q(f) = V o f. It’s easy to see that Q is bijective (this is proved in [ 4 ]); in fact 
given a function q £ L 2 (I), we can define f(t) = f* q(u)\q(u)\ du , and then verify 
that Q(f) = q. 

Since f acts on ACq(I,R n ) and Q : ACq(I, R n ) —t L 2 (I,R N ) is bijective, we 
can define an action of F on L 2 (I , R w ) in a unique way to make Q equivariant. In 
fact, it is easy to verify that the corresponding right action of T on L 2 (I,R N ) is 
given by (q * 7)(t) = q(7(f))v / 7 / (t). Furthermore, this action of F on L 2 (I,R N ) is 
by isometries since 

( 91 * 7 , 92 * 7 )= f 9 i {l(t) ) \! Y(t)q 2 (7 (t) ) V 7'(0 dt 
Jo 

= <n(l(t))q 2 ('y(t))y(t)dt = / qi{u)q 2 {u) du = (91,92)• 

Jo Jo 

Note that for the second to last equality, we relied on integration by substitution, 
which is valid because 7 is absolutely continuous. This is one important reason for 
insisting that our reparameterization functions are absolutely continuous. Thus, we 
replace our study of the action of T on ACq(I, R w ) by the study of the corresponding 
action of T on L 2 (/, R^), which is an action by linear isometries. In what follows 
we will be interested both in the action of F, and in the restricted action of T. Note 
to the reader: Our definition of “action by isometries” is simply that for all 7 £ T 
and for all 91,92 £ L 2 (I , R w ), (qi * 7, 92 * 7) = (91,92)- While this equation implies 
that the map L 2 (I 7 R N ) —> L 2 (I,R N ) induced by each 7 is injective, it does not 
imply that it is surjective. For example, suppose that 7 £ F is constant on some 
subinterval of I. Then for all q £ L 2 (I,M. N ), q * 7 = 0 on this same subinterval. Of 
course, for 7 £ F, the induced map is bijective, since T is a group. 

Denote by U(I,M. N ) the unit sphere {q £ L 2 (J,K JV ) : J\q{t)\ 2 dt = 1}. This 
corresponds to the set of functions in ACo(I, i^) having arc length 1, since if 
Q(f ) = q, it follows that |/'(t)| = \q{t)\ 2 , and the arclength of / can be written as 
fo |/'(i)| dt. U(I,R N ) is an invariant subset of L 2 (I,R N ) under the action of f, as 
is the sphere of any radius centered at 0 in L 2 (I,M. N ). If we wish to compare two 
curves in a way that is invariant to rescaling, a natural way to do this is to rescale 
both of them to have unit length before comparing them. Hence, we sometimes 
concentrate on the action of F on (7(/,I ff ). 

U(I,R N ) is an infinite dimensional submanifold of L 2 (J, R^). If we think of it 
as a Riemannian manifold, using the L 2 -inner product as a Riemannian metric, 
then the geodesics are the arcs of great circles, where by “great circle” we mean 
the intersection of U(I,R N ) with any 2 -dimensional linear subspace of L 2 (J, R^). 
The corresponding (geodesic) distance function between any q\ and q 2 in U(I,R N ) 
is given by cos _1 ((gi, q 2 )). Note that T and T act on U(I, I N ) by isometries. 

We now return our attention to the action of T on L 2 (I,R N ). Given q £ 
L 2 (I,R n ), let qF denote the orbit of q under F, and let L 2 (I,R N )/F denote the 
set of all these orbits. Define a function p : (L 2 (I,R N )/F) x (L 2 (J,K^j/F) —> R 
by p(9il\ 9 2 F) = inf (7l)72)erxr d(gi * 71, q 2 * 72) = inf 76r d(qi, q 2 * 7). The last 
equality follows from the fact that T acts by isometries. As usual, it’s easy to show 
that p is symmetric, satisfies the triangle inequality and is non-negative. However, 
it’s also easy to find examples where giT 7^ q 2 F, but p(qiF, q 2 F) = 0 . The reason 
for this is that the orbits are not closed sets, so all you have to do is choose q 2 to 
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be in the L 2 -closure of giT, but not in the orbit itself, in order to create such an 
example. 

For example, define 7 £ T by 


7(f) = 



for t < .5 
for t > .5 


While 7 ^ T, we now construct a sequence {7„} in T such that { y/Yn) approaches 
y/Y in the L 2 sense. To do this, let 


7 n{t) 


(2 — i)t for 0 < f < .5 
(l-i) + if for .5 < t < 1 


Now, let <7o(f) = c denote a constant function, where c £ is any nonzero 
vector. For each 7 £ T, (g 0 *7)(f) = \/l '(f)c. Then VT^c ^ g 0 r, but VT^c is a limit 
point of goT because each \jYn c is i n the orbit goT, and clearly \[Yn c 'JY C with 
respect to the L 2 norm. Hence go * 7 ^ goT, but go * 7 is in the closure of goT. 


Lemma 1. Assume q\ and g2 are elements of L 2 (I,M. N ). Then p(qiY,q2T) = 0 if 
and only if qiY and g2F have the same closure in L 2 (I : M. N ). 


Proof. Suppose p(gir,g2r) = 0 . Then there exist sequences {7„} and {7n} in T 
such that linin^oo d(qi * 7„,g2 * 7n) = 0 . Because T acts by isometries, it follows 
that limn^oo d(q±,q2 * TnTn 1 ) = 0 , proving gi is in the closure of g2F. Since there 
was nothing special about the orbit representatives we chose, and the argument is 
symmetric, if follows that each orbit is in the closure of the other; hence, giT and 
g2T have the same closure. The other direction is obvious and we omit it. □ 


Because of this lemma, the closure of an orbit is a union of orbits; if we define a bi¬ 
nary relation on L 2 (/, by stipulating that qi ~ g 2 means that giF and g 2 T have 
the same closure, then ~ is an equivalence relation. Let S(I,M. N ) = L 2 (/,R JV )/ 
Henceforth, we will use the symbol [g] to denote the point in 6>(J, R^) correspond¬ 
ing to the closure of the orbit gT. We will loosely refer to this as the “orbit” of g, 
even though it is actually a closed-up orbit. It is easily verified that our distance 
function d on L 2 (I : M. N ) induces a metric, which we also call d, on S(I, R^), defined 

by 

d([qi], [g 2 ]) = inf r d(w 1 ,w 2 ) 

Wie[qx],w 2 e[q^\ 

and this metric induces the quotient topology. 

We now prove a theorem giving a general form for elements of L 2 (J, R^). 

Theorem 1. Let q £ L 2 (I,M. N ). Then q can be written in a unique way as q = w*j 
where 7 £ F and w £ L 2 (J,R w ) has the property that |u>(f)| is constant a.e. for 
t £ I. 


Proof. By Theorem 4.1 of [ 7 ], every absolutely continuous function on a closed in¬ 
terval is rectifiable, and by Theorem 4.3 of the same book, every rectifiable function 
has a constant speed parametrization, with the reparametrizing function being ab¬ 
solutely continuous. Since for every q £ L 2 (I , R w ), there is an absolutely continuous 
function / such that q = Q(f ), our lemma follows immediately. □ 

We now wish to focus on a particular subset of S(I,R N ). We define a function 
q : I —> M. n to be a step function if we can express I as a finite disjoint union of 



6 


SAYANI LAHIRI, DANIEL ROBINSON, AND ERIC KLASSEN 


subintervals on each of which g is constant. Since we will only use this concept 
for L 2 functions, we don’t care what happens at the endpoints of each subinterval. 
It is a well-known fact (Prop. 10 , p .151 of [ 5 j) that the step functions are dense 
in L 2 (J, R^). We define S(I,’M. N ) st C S(I, R w ) to be the set of all equivalence 
classes that contain at least one step function. We see immediately that S(I , M JV ) s t 
is dense in S(I, R^). Furthermore, under our bijection Q between AC o and L 2 , 
the step functions in L 2 correspond to the piecewise linear functions (with finitely 
many pieces) in ACq. 

Later in this paper, we will prove that given two elements [<7] and [w] of S(I, R^), 
if at least one of them is in S(I , M. N ) st , then there exist elements q £ [<7] and w £ [w] 
such that d(q,w) = rf([<7], [u>]). We will also prove that if both [<7] and [w] are in 
S(I,R N )st, then q and w can both be chosen to be step functions. As a result, it 
will follow that S(I,M. N ) st is a geodesically convex subset of S(I,M. N ), in the sense 
that given [<7] and [w] in there exists a minimal geodesic in <S( 7 , R w ) 

joining [<7] to [u>] that lies entirely in 5 (/,K Ar ) st . 

3. Orbit Structure in S(/,R) 

In this section, we will specialize to the case N = 1, so all of our functions are 
scalar valued instead of vector valued. In this case, T C ACo(/,R), and Q(r) C 
t/(/,R). In fact, Q(r) = {q £ U(I, R) : q(t) > 0 a.e.}. To see this, we refer to 
Exercise 3.21 on page 82 of [3], which states that a continuous, strictly increasing 
function u : [a, b] —> R has an absolutely continuous inverse if and only if the set on 
which its derivative vanishes has measure 0. It is then immediate that the closure 
of Q(r) in {/(/,R) is {q £ U(I, R) : q(t) > 0 a.e.}. Note that this last set is equal 
to Q(f). Hence, the closure of <3(r) in [/(/, R) is Q(f). 

We begin by examining the simplest orbit in U (I, R). Let qo = 1 denote the 
constant function. What can we say about the orbit [170]? 

Lemma 2. [g 0 ] = C7(<2(r)) = {g £ XJ{I, R) : for all t £ I,g{t) > 0 a.e.} = Q(T). 

Proof. Given 7 G F, it’s immediate that (qo*j){t) = y/'y'ft) = Q( 7)(t). The lemma 
then follows immediately from the previous paragraph. □ 

Corollary 1. Given any gi £ U(I, R) and 772 G 1/(1,R) satisfying gift) >0 a.e. 
and 772(0 > 0 a.e., there is a sequence {7™} in F such that g± * 7„ —>■ 772 in the L 2 
metric. 

Proof. Since 771 and g 2 are both elements of Cl(< 5 (r)) = [q> 0 ], it follows that [771] = 
[772] = [go]- The corollary follows immediately. □ 

Let us think of go = 1 as the “north pole” of U(I, R), and —go = —1 as the 
“south pole”. Then the “upper hemisphere” is {77 G [/(/,R) : f* g(t)dt > 0 }, the 
“lower hemisphere” is {77 G U(I, R) : J[' g(t)dt < 0 }, and the “equatorial sphere” 
is {g £ U(I , R) : fg g(t) dt = 0}. 

We next note that [go] is completely contained in the upper hemisphere, and in 
fact does not intersect the equatorial sphere. That’s because if f 0 g(t) 2 dt = 1 but 
f ' g(t) dt < 0, then it is clear that there must be a set of measure greater than zero 
on which 77(f) < 0. 
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We now observe that even though [g 0 ] does not intersect the equatorial sphere, 
it does contain points arbitrarily close to the equatorial sphere! To see this, let 
0 < e < 1, and define g e £U(I, R) by 


9e(t) 


1/yfi for t < e 
0 for t > e 


and define v e £ U (/, R) by 


v e (t.) 



for t < e 
for t > e. 


Clearly, g e £ [go], v e is in the equatorial sphere, and d(g e ,v e ) = \/2 — 2/1 — e, 
which can be made as small as desired by taking e small. 

The next lemma will calculate the distance between two specific orbits, and will 
provide actual orbit representatives that realize this distance. 


Remark 1. Since T and f act by isometries on L 2 (I,M. N ), it follows that min¬ 
imizing the distance between orbit representatives of [g] and [to] is equivalent to 
maximizing their L 2 inner products. 


Lemma 3. Suppose the unit interval is expressed as a disjoint union of measurable 
sets, I = A U B, where A is assumed to have measure a. (For purposes of visual¬ 
ization, the reader may want to keep in mind the case in which A and B are simply 
subintervals of I.) Define 


w(t) 


1 for t £ A 
— 1 for t, £ B. 


Then d([go], [w]) = yj 2 — 2/a, and this distance is realized by the orbit representa¬ 
tives w £ [u>] and q A € [go], where we define 


QA(t) 


1 / y/a for t £ A 
0 for t £ B. 


Proof. First, it’s easily verified that fg w(t)q(t) dt = s/a (implying d(q, w) = y/2 — 2 y/a). 
So we just need to prove that this is the maximum over all representatives of [go]. In 
what follows, the key step will be the Cauchy Schwarz inequality, which states that 
for arbitrary / and g in L 2 (A), \ f A f(t)g(t) dt\ < (f A f(t) 2 dt) 1/2 (f A g(t) 2 dt) 1/2 . 
Continuing with the proof, choose an arbitrary q £ [g 0 ]. Then calculate 


q(t)w(t)dt= / q(t)w(t) dt + / q(t)w(t)dt. 


Clearly J B q(t)w(t) dt < 0 since q(t) > 0 and w{t) < 0 on B. By Cauchy Schwarz, 
f A q(t)w(t ) dt < 1 • s/a. These two bounds imply that f* q(t)w(t) dt < y/a. □ 


We remark that since geodesics in L 2 (I, ]$. N ) are straight lies, it is easy to write 
down a specific shortest geodesic from q A to w in the lemma above. It’s also easy 
to verify that the image of this geodesic in S(I, R) is a geodesic (in fact, a shortest 
geodesic) in this quotient space, according to the usual definition of geodesic used 
in metric spaces: 
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Definition: A geodesic in 5(7, R) is a continuous function a : [0, L\ —> 5(7, R) 
with the property that there exists a positive number K such that for all s £ [0, L\, 
there exists e > 0 such that for all ti,t 2 £ (s — e, s + e) D [0, L], d(a(ti), a(t 2 )) = 
I\\t 2 — ti|. 

Some fundamental problems regarding geodesics in 5(7, R) are: (1) Given any 
two points in 5(7, R) is there a geodesic joining them? Is there a shortest geodesic? 
Can we find this geodesic in some reasonable way, for some reasonable set of points 
in 5(7, R)? In Lemma [3| we found a precise description of the shortest geodesic 
between the particular orbit [90] £ 5(7, R), and any orbit of the form [u>], where w 
has the form 


w{t) 


1 for t £ A 
— 1 for t £ B. 


However, in Theorem [lj we proved that every element of 7/(7, R) can be uniquely 
expressed as w * 7, where w is of the form described above and 7 £ T. (In fact, we 
proved a more general version of this for U(I,M. N )). Hence, we now know that for 
every orbit [g] £ 7/(7, R), there is a unique shortest geodesic in 5(7, R) joining [<7] 
to the particular orbit [go], and in fact we have given a precise description of that 
geodesic. 


Lemma 4. Let 0 < a < b < 1 and suppose q and w are two elements of U (7, R) 
with the following three properties: 

(1) For all t £ (a, b), q(t) > 0 and w(t ) > 0. 

(2) f^q(t) 2 dt = J^w(t) 2 dt 

(3) For all t fL (a,b), q(t ) = w(t). 

Then [g] = [w]. 


Remark 2. We may replace condition (1) by the assumption that for all t £ (a, b), 
q(t) < 0 and w{t) < 0, and the Lemma still holds, with the same proof. 

Proof. By Corollary [TJ there is a sequence {A„} of absolutely continuous homeo- 
morphisms [a, b] -A [a, 6] with the property that A“ 1 (0) has measure 0, such that 
in L 2 ([a, 6]), q * \ n —t w. (Note that in the proof of Corollary [l] it makes no dif¬ 
ference that we have changed the interval from 7 to [a, 6] and changed the value of 
fbq(t) 2 dt = J^w(t) 2 dt from 1 to whatever it is.) Then for each n, extend X n to 
7„ £ T, by extending it as the identity outside [a, b}. It is then clear that in L 2 (I), 
q * 7n —/ w. □ 

Lemma 5. Let q £ L 2 { 7, R). Then [q] £ 5(7, R) st if and only if there is a finite 
sequence 0 = to < ti < • ■ • < t n = 1 such that for each j, either q(t) > 0 for all 
t £ [tj-i,tj] (a.e.), or q(t) < 0 for all t £ [tj-i,tj] (a.e.). 


Proof. First, suppose there exists a finite sequence 0 = to < ti < • • ■ < t n = 1 such 
that for each j, either q(t) > 0 for all t £ or g(i) < 0 for all t £ 

Then, by successive applications of Lemma [4] we can find a new function in [g] that 
is constant on each [tj-i,tj\. It follows that [g] £ 5(7, R) st . 

On the other hand, suppose [g] £ 5(7, R) st . Let w be a step function in [g]. 
It follows that there exists a sequence {7^} in T such that w * 7* —»• g in 7/(7, R) 
with respect to the L 2 metric. Because of this convergence in L 2 , we may choose 
a subsequence of {7;} such that 1/1*7* —> q a.e. Since w is a step function, we 
can find a sequence 0 = to < ti < • • ■ < t n = 1 such that w is constant on each 
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interval (tj-i,tj). For each j £ {1,..., n — 1} and * = 1, 2, 3,, let tj t i = 7j _1 (^')- 
By compactness of I, we can replace the sequence {7^} by a subsequence with the 
property that for each fixed jo, the sequence converges to a number tj 0 £ /, 

as i -> 00. Now, fix j £ {l,...,n}. We will show that either q(t) > 0 a.e. or 
q(t) < 0 a.e. for t £ WLG, assume that w(t) > 0 on (tj-\,tj). Let 

t £ By definition of the action of T, we know that w * 7 i(t) > 0 for all 

t £ Choose an e > 0 such that there exists an N such that for all 

i > N, (t — e,t + e) C It follows that for all i > N, w * 7*(t) > 0 on 

(t — e,t + e). Since w * 7^ —> q a.e., it follows that q(t) > 0 almost everywhere in 
(tj -1 — e, tj + e). Thus, we have shown that every t £ (tj-i,tj) has a neighborhood 
on which q(t) > 0 a.e. Since a countable number of these neighborhoods cover 
it follows that q{t) > 0 a.e. in (tj-i,tj). Thus, we have produced a finite 
sequence {tj} such that on each either q(t) > 0 a.e. or q(t) < 0 a.e. □ 

Lemma 6. Let 91,92 £ L 2 (/,K), and assume that [91], [92] £ S(I,M.) st . Then there 
exist step functions w\ £ [91] and W2 £ [92] such that d(wi,u> 2 ) < ^(91,92). 

Proof. It suffices to find step functions w 1 £ [91] and u >2 £ [92] such that (w±,W 2 ) > 

(91192)- By applying Lemma [ 5 ] to 91 and q 2 and then taking the union of our finite 
fi-sequences, we obtain a single finite sequence 0 = to < ■ ■ • < t n = 1 such that 
for each i = 1 ,n, both 91 and q 2 have constant sign on .t,]. By “constant 
sign”, we mean that on this interval either 91 (t) > 0 a.e. or 91(f) < 0 a.e., and 
either 92(f) > 0 a.e. or 92(f) < 0 a.e. We now alter 91 and 92 on each of these 
subintervals in the following way. 

( 1 ) If 91 and q 2 have the same sign on [fj_i,tj], then on this interval simply 
replace 91 by the constant function which has the same sign and square 
integral as 91. Do the same for q 2 . 

( 2 ) If 91 and q 2 have different signs on [fj_ 1, tj], then replace 91 on this interval 

by a function that is constant on the first half of the interval, zero on the 
second half of the interval, and has the same sign and square integral on 
[tj_i,tj] as 91 does. Replace q 2 on this interval by a function that is zero 
on the first half of the interval, constant on the second half, and has the 
same sign and square integral as q 2 has on [f»_1, f»]. 

Call the resulting functions w\ and w 2. By performing these replacements 
one subinterval at a time, we see by Lemma [ 4 ] that [roi] = [91] and [w^] = [92]• 
Furthermore, by the Cauchy Schwarz inequality, we see that on each subinterval 
[fj_i,fj] where we performed alteration (1), f t ' i w\(t)w2(t)dt > J t ' qi(t)q2(t)dt. 

For each interval where we performed alteration ( 2 ), the same inequality holds, 
since J*' wi(t)w 2 (t)dt = 0 , while /*' qi{t)q 2 (t)dt < 0 . This completes the proof 
of the lemma. □ 

Lemma 7. Suppose for each n, f n : I —> R is an L 2 function, and f : I —> R is also 
L 2 . Suppose that f n —» / in the L 2 norm. Assume that for each n, we are given 
an a n in I such that f Q "(/ n ) 2 = 1 for all n. Furthermore, suppose that a n —» a in 
I. Then /“ / 2 = 1. 

Proof. By passing to a subsequence, we can assume that f n —> f a.e. in I. Next, 
recall a variant of the dominated convergence theorem: if |fe„| < h n , k n —» k a.e., 
h n h a.e., and J 0 K -f Q h < 00, then f Q k n -> f g k. Since |||/ n || 2 - H/H2I < 
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II fn ~ fh, it follows that Jo f 2 f* f 2 . Taking k n = \f 2 - f 2 \, k = 0 , h n = 
fn + P, and h = 2 f 2 , it follows that \f 2 - f 2 \ ->• 0. 

We now compute 







f- 


The first of these terms was proved to approach 0 at the end of the last paragraph; 
the second approaches 0 by the absolute continuity of the integral, since a n —> a. □ 


Lemma 8. Suppose q and w are elements of L 2 (I,M.), and assume that 0 = to < 
ti < • • • < t n = 1 is a sequence such that for each i both of the following statements 
are true: 

• either q(t) > 0 for t £ [tj_i,tj] a.e. or q(t) < 0 for t £ [tj_i,tj] a.e. and 

• Iti, q(t) 2 dt > °. 

Then w £ [g] if and only if there exists a sequence 0 = to < ti < • • • < t n = 1 such 
that for each i, both of the following statements are true: 

• either q(t) > 0 for t £ [t»_i, t»] a.e. and w(t) > 0 for t £ [tj_i,tj] a.e. or 
q(t) < 0 for t £ [tj_i,tj] a.e. and w(t) < 0 for t £ [tj_i,tj] a.e. and 

• fiit w(t) 2 dt = q(t) 2 dt. 

Proof. We omit the proof; it is basically the same as the proof of Lemma [5j but 
uses Lemma [7] to keep track of the square integrals. □ 

Lemma 9. Let q £ L 2 (I, R). Then qT C [g]. 

Proof. We will prove this first for step functions, and then extend by density to 
all of S , so we start by assuming that q is a step function, and let 7 £ T. Let 
0 = to < ti < • • • < t n = 1 be the finite set of points at which q(t) changes values. 
For each i, choose ti £ I such that 7 (ti) = L. Letting w(t) = g*7(f), it follows from 
integration by substitution that the hypotheses of Lemma [8] are satisfied. Hence, 
w £ [g], which completes the proof for q a step function. 

Now, let g £ L 2 (I, R) be arbitrary, and let 7 £ T. Let e > 0 be given. By 
density, choose a step function v £U(I, R) such that d{q, v) < e/3. By the previous 
paragraph, we know that v * 7 £ [u]; this means we can choose 7 £ T such that 
d(v* 7, u*7) < e/3. By the triangle inequality, and the fact the T acts by isometries, 
we then conclude that 


d(q * 7, q * 7) < d(q * 7, v * 7) + d(v * 7, v * 7) + d{v * 7, q * 7) < e, 
which completes the proof of the current lemma. □ 


Define a function w £ U(I , R) to be in standard form if it is of the form described 
in Lemma [ 3 j i.e., 


w(t) 


1 for t £ A 

— 1 for t £ B 


where I = A U B is a partition of I into two disjoint measurable sets. 


Lemma 10. Suppose q and w are both in standard form, and q w in L 2 (i.e., 
the set {t £ I : q(t) 7^ w(t)} has measure greater than 0 /. Then w £ [g]. 
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Proof. First we make a simple calculation. Let 0 < a < U, and define p : [0, U\ — > K. 
by p{t) = y/ta + y/(U — t)(U — a). Then p(t) has a unique maximum at p(a ) = [/; 
in fact p'(t) > 0 for t < a and p'{t) < 0 for t > a. This is an easy Calc I exercise! 

Since q and w are both in standard form, we have two partitions I = A U B and 
I = C U D such that 


q(t) 


1 for t € A 
-1 for t. G B 


and 


w(t) 


1 for t G C 
-1 for t G D. 


Let ji denote Lebesgue measure. The remainder of the proof will consist of consid¬ 
ering the two cases p(A) yf g{C) and p(A) = p(C). 

Case 1: Assume p(A) p{C). Suppose 7 G T, and let A = 7 ~ 1 {A) and 
B = 7Using integration by substitution, we see that f^(q * 7 (t)) 2 dt = 
J A {q(t)) 2 dt = p(A) and f B (q* 7 (t)) 2 dt = J B (q(t)) 2 dt = p(B). Now compute: 


q*^{t)w[t)dt = 


q*"f(t)w(t)dt+ / q*j(t)w(t)dt+ / q* / y(t)w(t)dt+ 


q*"f(t)w(t)dt 


>AnC 


>AnD 


/Bnc 


I BHD 


< 


q * 7 (t)w(t)dt + f 


q * 7 [t)w(t)dt 


JAnC JBnD 

since on An I? and BnC, q* 7 and w have opposite signs, so these two terms make 
a negative contribution to the integral. However, by the Cauchy Scharz inequality, 


q * j(t)w(t)dt < 


{q*i{t)) 2 dt x 


( w(t)) 2 dt 


/Anc 


/Anc 


/Anc 


< 


{q * l{t)) 2 dt\ / (■ w(t)) 2 dt = sJp{A)p{C). 


Similarly, 


/ Br\D 


q * j(t)w(t)dt < y/p(B)p(D). 


Combining these with the last inequality gives 

[ q*^{t)w{t)dt < sjg{A)g{C)+yJ p{B)p{D) = sJp{A)p{C)+s/{l - p(A))( 1 - p(C)) 


Since we are assuming here that p(A) yf /z(C'), it follows from the calculation we 
made at the beginning of this proof that this upper bound is strictly less than 1. 
Also, this upper bound is independent of which element 7 G T we chose. Since 
(q * 7, w) has an upper bound that is strictly less than 1 on the orbit qT, it follows 
that d(q * 7,1c) has a lower bound that is greater than zero on this orbit. This 
finishes the proof in Case 1 ; we have shown that if p(A) yf p{C), then w ^ [</]. 

Case 2: Assume p(A) = p(C) = U (so p{B) = g/{D) = 1 — U). 

Let /, g G A Co be the absolutely continuous functions satisfying Q(f) = q and 
Q(g) = w. Since Q is a bijection, / y^ g, so there exists 2 G I such that f(z) y^ g{z). 
By definition of Q , f(z) = p{A n [ 0 , z\) — g(B n [ 0 , z\) and g(z) = g{C D [ 0 , 2]) — 
p(D fl [ 0 , 2]). Since 2 = /x(A D [ 0 ,2]) + p{B (~l [ 0 , 2]) = p(C D [ 0 ,2]) + p(D (~l [ 0 , 2]), 
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we may conclude that p{A fl [0, z]) ^ pi(C fl [0, z]); without loss of generality, let’s 
assume p,(A fl [0, z]) < p,(C fl [0, z]) and hence p,(B D [0, z]) > n(D fl [0, z]). 

Let 7 € r be arbitrary. 

Compute: 

(q* 7, w;) = / q * "f(t)w(t)dt = / q * j(t)w(t)dt + / q* "f{t)w(t)dt 
Jo Jo J z 


< 


L 


[o,z]n(AnC) 


q * 7 (t)w(t)dt 


'[o,z]n(snD) 


q * 7 (t)w(t)dt 


+ 


q * 7 (t)w(t)dt 


q * 7 (t)w(t)dt + / 

^[z,i]n(AnC) i[z,i]n(BnD) 

where this inequality follows because on the parts of the interval we left out, the 
contribution of the integrand is negative. Continuing by the Cauchy Schwarz in¬ 
equality: 


< 


'[o,z]n(Anc) 


(q*"/(t)) 2 dt x 


qo,z]n(AnC) 


w(t) 2 dt+* 


qo,z]n(snD) 


(g*7(f)) 2 dC 


qo,z]n(snD) 


j(t) 2 dt 


< 


I (<7*7 {t)) 2 dt x 

[z,i]n(AnC) 1 

[ {q*i{t)) 2 dtj [ 

/[o,z]nA 1 


'[o,z]nC 


[z,i]n(AnC) 
w(t) 2 dt+. 


w{t) 2 dt+. 


qo,z]nB) 


[z,i]n(snD) 

(< q*j{t)) 2 dt , 


(q * i{t)) 2 dt\ 


'[o,z]nD 


[z,i]n(snD) 
w[t) 2 dt 


){t) 2 dt 


'[z,i]nA 


_(q*j(t)) 2 dt. 


'[z,i]nc 


w(t) 2 dt+. 


'[z,i]ns 


(q*'y{t)) 2 dt 


y J[z,i]no 


]{t) 2 dt 


'[0, 7 (z)]nA 


q(t) 2 dt. 


qo,z]nc 


w(t) 2 dt + 


/[o, 7 (z)]nB) 


q(t) 2 dt. 


'[o,z]nD 


]{t) 2 dt 


+ i 


i[j(z),i\nA 


q{t) 2 dt. 


s,i]nc 


w(t) 2 dt 


'[ 7 (z),i]ns 


q(t) 2 dti 


,i]nD 


w(t) 2 dt 


= \/ M[o, 7(2)] n A)n([ o, z] n C) + vV( [0,7(2)] n B)p,{[o,z] nD) 

+^([7(2), 1] n A)n{[z, 1] nc) + yj ^([7(2)) 1] n B)p{[z, 1] n d) 

= V m([o, 7(2)] n A)n({ 0, z] n C) + yj p([o, 7(2)] n B)p([ 0, z\ n D) 

+ y/(U - At([0,7(2)] n A))(C/ - /x([0, z] n C))+y/( 1 - 17 - p([0,7(z)] n B))(l - U - m([0, 2] n £>)). 

Reversing the order of the middle two terms gives 

(l) = V Mi 1 0, 7(2)] n A)p([o, zlncj+Vp- M[o, 7(2)] n A))(c - p([o, z] n c)) 

Wm([0,7(z)] n B)M[ 0 , 2] n l>)+\/(i - u - M ([o, 7(2)] n b)){ 1 - - M ([o, z] n £>)). 

Now there are two possibilities to consider: Either 7(z) < z or 7(z) > z. 

Case (i): Assume 7(z) < z. In this case /i([0,7(z)] Cl A) < /x([0, z] fl A) < 

/i([0, z] (1 C). 

By the observations about p(t) and p'(t) made in the first paragraph of this proof, 
we conclude that the sum of the first two summands of expression [l] is bounded 
above by 

VM[o, 2] n A)n([ 0, z] n C) + V(U - M[o, 2] n A)){u - M[o, z] n c)) 
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and that this bound is strictly less than U. Also, by the first paragraph of this 
proof, the sum of the third and fourth summands of expression [l] is bounded above 
by 1 — U. As a result, in Case (i), we have an upper bound for (q * j,w) which 
is strictly less than 1, and is independent of 7 £ F (except for the condition that 
7W < z). 

Case (ii): Assume 7 (z) > z. In that case, ^i([0,7 (z)] n B) > /u([ 0 , c] n B) > 
/i([0,2] n D). Then, by the observations about p(t) and p'(t) made in the first 
paragraph of this proof, we conclude that the sum of the third and fourth summands 
of expression [l] is bounded above by 

Vm([ 0 , z\ n B),i([ 0, z\nD) + y/{i-u- m([ 0 , z]nB)){i-u- /x([o, z] n £>)). 

and that this bound is strictly less than 1 — U. Also, by the first paragraph of this 
proof, the sum of the first two summands of expression [l] is bounded above by U. 
Hence, in Case (ii), we also have an upper bound for ((7*7,11;) which is strictly less 
than 1, and is independent of 7 £ T (except for the condition that 7(z) > z). 

Taking the greater of these two upper bounds, we have an upper bound for 
(q * 7, w) that is strictly less than 1 and is completely independent of the choice of 
7 £ T. As a result, we have a lower bound for d(g*7, w) that is strictly greater than 
0 and is completely independent of 7 £ T. This proves that w fL [<7], and completes 
the proof of Lemma [lOj □ 

Let SF(I,M) = {q £ U(I,M) : q is in standard form.} By the unique arclength 
parametrization theorem quoted earlier (Theorem [Tj) , we can express £/(/,R) as a 
disjoint union as follows: 

t/(/,R) = wf 

w£SF 

By Lemma [9] qT £ [q] for each q £ SF. By Lemma 
then q ^ [w]. It follows that [g] D [w] = 0 and therefore gT n [tu] = 0 . By the 

disjoint union above, since [u>] n gf = 0 for all q £ SF where q ^ w, it follows that 

[u>] C wf. Combined with Lemma [oj this proves the following theorem. 

Theorem 2. For all w £ SF, [w] = itT. 

Corollary 2. If q £ L 2 (J, R), and g —1 ( 0 ) has measure 0 , then [g] = gT. 

Proof. First, assume that g £ U{I, R). By Theorem [lj we can write g = w * 7 = 
(w o 7 )Vt 7 ) where 7 £ f and |u>(f)| = 1 for almost all t £ I. Since g _1 ( 0 ) has 

measure 0 , it follows that {t £ I : 7 '(t) = 0 } has measure 0 , so 7 £ T. Hence, we 

can write w = g*7 -1 and, therefore, [g] = [«;] = wT = g7 _1 T = qT, since 7^ 1 T = F. 
For g ^ the result also follows, since multiplication by a constant nonzero 

scalar is a homeomorphism that commutes with the action of T. 

□ 


10 


if g, w £ SF and g / «), 


4. Orbit Structure in 5(1, R N ) 

In this section, we will extend Theorem [2] and Corollary 1 from L 2 ( I , R) to 
L 2 (/,M Ar ). This has a few more technical difficulties than one might expect; hence 
it gets its own section. Just as in the case of N = 1, we define w £ U(I,R N ) to 
be in standard form if |iu(t)| = 1 for t £ I a.e. Let SF(I,M. N ) be the subset of 
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U(I,M. N ) consisting of functions in standard form. By Theorem [l] we can express 
[/(/jK^) as the disjoint union 

(2) U(I,R n ) = wT. 

w£SF(I, R N ) 

We define q £ L 2 (J, R w ) to be a step function if we can express / as a finite union 
of disjoint sub-intervals in such a way that q is constant on each of these subintervals. 
Again, we don’t care what happens at the endpoints of the subintervals since q is 
only well-defined almost everywhere. 

Lemma 11. For all q £ L 2 (I, R^), qt C [g]. 

Proof. The proof follows the same lines as Lemma [9j with some minor adjust¬ 
ments. We first prove the lemma for step functions, then extend by density to all 
of U{I,R N ). 

Let q £ U(I,R N ) be a step function, and let 7 £ F. Choose 0 = to < ti < 
• • • < t n = 1 such that q is constant on each interval {ty,ty + i). Choose 0 = 
So < Si < • • • < s n = 1 so that y(sfc) = ty for each k = 0,1,... ,n. For each 
k , define 7 *, = 7 | [ Sfe _, ,s fe ] - Clearly, for each k , the function is an element of 
the sphere of radius y/ty — tk-i centered at 0 in L 2 [sk-i, Sk]- Another element 
of this sphere is the constant function wy(t) = \J(ty — tk-i)/(sk — s k - 1 ). For 
j = 1,2,3,..., let {wkj} be any sequence of functions along the geodesic arc 
from wy to yji k in this sphere such that limj^oo wyj = s/ly i n L 2 [sy-i, sy\. 
Note that for each j, wyj(s) > 0 for all s £ [sfc_i,Sfc]. Finally, define 7 k,j(s) = 
tk -1 + J (wkj(u)) 2 du for all s £ [sfc_i,Sfc]. Clearly, for all k and j, 7 yj is 
an absolutely continuous, monotone homeomorphism from [sfc_i,Sfc] to [ty-i,ty\. 
For each j , define jj : I —> I by setting 7 ? -(s) = 7 fej(s) for all s £ [sfc_i,Sfc]. 
Clearly, 7 j £ T for all j. Assuming that q(t) = cy on [tk-i,tk], it follows that 

on [s fc _i,Sfc], q * 7 j(s) = \Jl' k} j{ s ) c k = Wk,j{s)c k . By definition of w k ,j, it follows 

that in L 2 [s fc _i, s fc ], lim^^oo q * = linij-^oo w k ,jC k = \fik c k = q* 7- Since this 

limit holds in each subinterval separately, it must hold in L 2 (I). Thus, we have 
produced a sequence 7 ,- in T such that q * 7 , —> q * 7 , which proves the lemma for 
step functions. The extension from step functions to general elements of L 2 (/, R w ) 
is exactly the same as in the proof of Lemma [9j so we omit it. □ 

Next, we want to prove that for all functions w £ U(I,R N ) that are in standard 
form, [w] £ wr. Such a function w can vanish only on a set of measure zero; 
however, each component function of w may vanish on a set of measure greater 
than zero. In order to make our proof run more smoothly, it is helpful to prove 
that given such a w, we can rotate it (using a matrix in 0 (n,K)) to obtain a new 
function in U(I, R w ) with the property that all of its component functions vanish 
only on a set of measure zero. The next few lemmas prove that such a rotation 
exists. 

Lemma 12. Let q : I —> M. N (where n > 1) be an L 2 function with the property that 
< 3 ,_ 1 ( 0 ) has measure 0. Then there exist N pairwise orthogonal (A^— 1)-dimensional 
linear subspaces of R N , which we denote by Hi ,..., Hyy, with the property that 
q~ l (Hy) has measure zero for k = 1,..., N. 
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Proof. First, we note that there exist at most countably many lines l through the 
origin in R w with the property that q _1 (Z) has nonzero measure. To see this, 
note that as l varies over all lines through the origin of R w , the sets q~ 1 (l — {0}) 
are disjoint from each other. But it follows that for all but a countable set of 
these lines, the set q~ x (l — {0}) has measure 0. This is because I cannot contain 
an uncountable collection of pairwise disjoint subsets, all with measure greater 
than 0. (The proof of this is easy: Let C denote a collection of pairwise disjoint 
measurable subsets of I. For each integer n > 0, define the subcollection C n C C 
by C n = {C £ C : n(C) > ^}. Clearly, the cardinality of C n is at most n. The 
subcollection of C consisting of all sets of measure greater than 0 is just the union 
of these sets C n , which is countable.) It follows that for all but a countable set of 
lines l through the origin in R w , /i(g _1 (Z)) = L L {q~ l {l ~ {0})) = 0. Note that this 
last equation used the fact that n(q~ 1 ( 0)) = 0. 

We now construct a sequence 0 C Pi C P 2 C ■ • ■ C Pn- 1 of linear subspaces of 
R w , such that for each k, Pk has dimension k and /r(g _1 (Pfe)) = 0. We construct 
this sequence inductively. In the first paragraph of this proof, we showed that there 
exists a line l through the origin such that q _1 {l) has measure 0. Let Pi be any 
such l. For the inductive step, assume we have already constructed 0 C Pi C 
• • • C Pfc satisfying the conditions, where k < N — 1. Choose an orthonormal basis 
{til, ■ • •, Uk\ of Pfc. Let u and v be any orthogonal pair of unit vectors, both in the 
orthogonal complement of Pk in M. N . For each real number 9 £ [0,7r), let Sg to 
be the linear span of {u±,... ,Uk, (cos 6)u + (sind)t;}. Clearly the subsets of R N in 
the collection {Sg — Pk}ee[o,Tr) are pairwise disjoint, and hence the subsets of I in 
the collection {g -1 (Sg — Pk)}ee[o,tr) are also pairwise disjoint. It follows that for all 
but a countable set of 9 £ [0,7r), fj.{q~ 1 {Sg — Pk)) = 0. But since /i( q~ 1 {Pk )) = 0, 
we know that fj,{q~ 1 {Sg)) = ii(q~ 1 (Sg — Pk)) = 0 for all but a countable set of 9. 
Choosing one of these 6, we then set Pfc + i = Sg, completing the inductive step of 
the construction of our sequence 0 C Pi C P 2 C • • • C P/v-i- 

We now prove Lemma |12| by induction. It is trivially true for N = 1. Assume 
the lemma has been proved for functions / —► R^ -1 , and now suppose we are 
given an L 2 function q : I —> R^ such that g -1 (0) has measure 0. Using the last 
paragraph, construct a sequence 0 C Pi C P 2 C • • • C Pjv-i of linear subspaces 
of R^, such that for each k, Pk has dimension k and n(q~ 1 (Pk)) = 0. We are 
now going to make an adjustment to Pat— i- Let be an orthonormal basis 

of the orthogonal complement of P/v -2 in R W - For each 9 £ [0,7r), let Bg be the 
hyperplane spanned by Pn -2 and the vector (cos0)m+ (sin9)v, and let lg be the 
orthogonal complement of Bg in R w . Since we know that q~ 1 {PN- 2 ) has measure 
zero, we can argue just as in the last paragraph to show that for all but a countable 
set of 9 £ [0, 7r), q~ 1 {Bg) has measure zero. Similarly, for all but a countable set 
of 9 , q~ 1 {lg) has measure 0. Since the union of two countable sets is countable, 
it follows that we can choose a 9 0 £ [0,7r) such that both q~ 1 [Bg 0 ) and q~ 1 (lg 0 ) 
have measure 0. Set Hn = Bg 0 and set l = lg a . Define q : I —>P[n by q = Hoq, 
where II denotes orthogonal projection R N —► H N . Clearly, g -1 (0) = g _1 (Z) has 
measure 0. Also, since projection decreases norms, q is still L 2 . By the induction 
hypothesis, we can find a pairwise orthogonal set of N — 1 subspaces of Hn, which 
we will denote by Hi,... ,Hn~i, each of dimension n — 2, such that g _1 (7Lfe) has 
measure 0 for k = 1,..., TV — 1. Now define Hk = Hk © l for each k = 1,..., N — 1. 
Clearly, Hi,, Hn comprise a set of N pairwise orthogonal (N — l)-dimensional 
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subspaces of R w . Also, for k = 1,..., N — 1, has measure 0. 

Since we already know that ( 7 _ 1 (.ffjv) has measure 0, this completes the proof of 
Lemma [12] □ 


Let us think of the elements of R N as column vectors, so if q £ L 2 (7,R^), we 
can write q[t) = (qi(t), ..., qiy(t)y, where the “prime” denotes the matrix trans¬ 
pose. Then the group of orthogonal matrices 0(N, R) acts on R^, ACo(/, R^), 
L 2 (/,R Ar ) and [/(/, M. N ) from the left in the obvious way. Furthermore, this action 
on L 2 (I,R N ) preserves the L 2 inner product, and hence takes U(I, R w ) to itself. 
It also commutes with the bijection Q : ACo(I,M . N ) —► L 2 (I,'M. N ) defined earlier in 
this paper. In addition, the left actions of 0(N, R) on AC'o(/,R JV ), L 2 (I,R N ) and 
U(I ,R W ) commute with the right actions of T and T on all three of these spaces. 


Lemma 13. Suppose q £ L 2 (I,R N ), where we write q{t) = (qi(t), • • • ,qN{t))', and 
assume that g _1 (0) has measure zero. Then there exists a matrix A £ 0(N,R), 
such that if we define q(t) = (gi(t),..., gjv(t)) / by q(t) = Aq(t), then qf 1 ^ 0) has 
measure 0 for all k = 1,..., N. 


Proof. By Lemma 12 we can choose N pairwise orthogonal (N — l)-dimensional 
linear subspaces of R N , which we denote by Hi,... ,Hn, with the property that 
q ,_ 1 (i7fe) has measure zero for k = 1,... ,7V. For each k, let u k £ R N be a unit 
vector orthogonal to H k . Clearly, {u±,... ,un} forms an orthonormal basis for 
R^. Let {ei,...,ejv} denote the standard basis for R N . Let A £ 0(N, R) be 
the unique matrix satisfying Au k = e k for all k. Define q(t) = (qi{t ),..., g f jv(t)) / 
by q{t) = Aq(t). By definition of A, it is immediate that qfi 1 (0) = <? _ 1 (e^ ) = 


q 1 {ujr) = q 1 (77fc) has measure zero. 


□ 


Lemma 14. Suppose q £ L 2 (I,R) satisfies p(q 1 (0)) = 0. Also, assume that { 77 } 
is a sequence in T, and t £ T. If linq-^oo q * Tk = q * t (with respect to the L 2 
metric), then for all t £ I, limfc-xx, 77 (t) = r(t). 


Proof. First, we observe that it suffices to prove the lemma under the additional 
assumption that |q(t)| = 1 for almost all t £ I. For, suppose we have already 
completed the proof under this extra assumption. By Theorem [l] we know that we 
can write q = w * 7 where \w(t) | is constant for almost all t £ I and 7 £ T. Since 
we are assuming that fJ.(q~ 1 {0 )) = 0, it follows that 7 £ F, since 7(f) = 0 only for 
t in a set of measure 0 . Now, since we are assuming that Hindoo q * Tk = q * t, it 
follows that limfc-^oo (717) = w* (77") in L 2 . By the version of the lemma that we 
are assuming to be proved, it follows that for all t £ I, lim^-s-oo 7(17(f)) = 7(r(f)). 
Since 7 £ T is continuous and bijective, so is 7 -1 , hence we conclude that for all 
t £ /, halloo 77(f) = r(£). 

So we now prove the lemma with the assumption that \q(t)\ = 1 for almost all 
t £ I. Assume that lim^oo q*T k = q*r with respect to the L 2 metric. We proceed 
by contrapositive. Suppose there exists a to for which lim^oo Tk(to) 7 ^ t(£o). (This 
includes the possibility that Hindoo T k (to) does not exist.) Therefore, there exists 
e > 0 such that for all M > 0, there exists a k > M with |r fe (t 0 ) — r(£ 0 )| > £• 
Consider a 77 such that It*,( to) — t(£ 0 )| > e - There are two cases. 

Case 1: Suppose T k (to) < r(to) — e. Compute (q * T k ,q * t) = 

f (q*Tk){t){q*T)(t)dt= f (q*Tk)(t)(q*T)(t)dt+ [ (q* T k )(t)(q* r)(t)dt 
Jo Jo Jt 0 
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< y/ (q*Tk)(t) 2 dt^J ( q*r)(t) 2 dt + ^(g * T k )(t) 2 dt^ ( q*r)(t.) 2 dt 


(by Cauchy Schwarz) 


r T k(t 0 ) / rr(t 0 ) rl / fl 

q{t) 2 dt\ / q(t) 2 dt + J / q(t) 2 dt< / q(t) 2 dt 


'Tfc(io) 


MM 


(integrating by substitution) 

= V T k(to)V T ( t o) + T k (t 0 )\/l - r(t 0 ) 


(using |g(f)| = 1 for almost all t £ I). 

By the calculation at the beginning of the proof of Lemma [lOj this last quantity 
is less than 

V T ( t o) - eV T (to) + \A - (rtf o) - e)\/l - t(£ 0 ) 

which, in turn, is less than 1. Define M e = y/r(to) — e\/ r (^o)+\/l — (r(fo) — e)\/l — r(f 0 ). 
So in Case 1, we have shown that (g * r k ,q* t) < Af e < 1. 

Case 2: Suppose Tfc(f 0 ) > r(£ 0 ) + e. We skip the very similar details; the 
end result is that we produce an m e such that (q* r k ,q *t) < m e < 1. Letting 
9 e = min{^/2 — 2M e , ^2 — 2m e } > 0, we see that d(q*Tk,q*r) > 9 e > 0. It follows 
that q*Tk -/> q*r in the L 2 metric, which proves the contrapositive and completes 
the proof of the lemma. □ 


Theorem 3. Let q : I —► 1^ be any L 2 function such that \q(t)\ = 1 for almost all 
t £ I. Then [q] = gf. 


11 we only need to prove that [g] C gib 
acts on L 2 (J, $L n ) by isometries and this action commutes with 

that for each i = 1,..., TV, q~ : (0) 


13 


Proof. By Lemma 
Since 0(N, 

the action of T, we may assume by Lemma 
has measure 0. Suppose v £ [g]; then there exists a sequence {qfc} in T such 
that lim^oo g * 7 *, = v (with respect to the L 2 metric). It follows that for each 
i = 1 limfc^oo g;* 7 /c = ig. For each i. we can write qt = Wi*<Ji , where eq £ T 

and Wi(t) = ±1 for almost all t, by Theoremjl] Then for each i , lim^oo u>j*(cq 7 /.) = 
Vi . Since wqf — [uq] is a closed set, it follows that there exists 7 * £ f such that 
Vi = Wi * 7 Now, since qi(t) = (wi * eq)(f) vanishes only on a set of measure 0, 
it follows that &i(t) also vanishes on a set of measure 0. Hence, cq £ T. Hence, 
we may write ug = qi * u~ , where a~ Y £ I\ Therefore, Vi = qi * (a~ ji), where 
er” 1 ^ £ f. Letting t* = cr” 1 ^, we have now proven that if v £ [g], then for each 
i, we can write Vi = qi * -q, where each r, £ T. All that is left to prove is that 
T\ — * * * — T n ; it will then follow that v = q * r where r = 17 = • • ■ = qv • However 
Lemma 14 implies immediately that for each t £ J, Tj (£) = lim^oo 7 ^ (f), which 
implies that r = ti = • • • = r„. This completes the proof of the theorem. □ 


Corollary 3. J/g £ L 2 (/, K^), and g x (0) has measure 0, then [g] = gf. 

The proof is the same as that of Corollary [2j so we omit it. 

Corollary 4. If gi,g 2 £ U(I,R N ), then d([gi], fe]) < y/2. 
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Proof. Define two elements 71,72 € T as follows: 


7iW 


0 for 0 < t < .5 
2 f — 1 for .5 < t < 1 


72(f) 


2 f for 0 < t < .5 
1 for .5 < t < 1 


Then an easy computation shows that (91*71,92*72) = 0 , so d(qi *71,92 *72) = a / 2 - 
The corollary follows. □ 


For [/(/, R), it’s interesting to note that this maximum distance is actually 
achieved by [91] and [92], where 91(f) = 1 and 92(f) = — 1 . It is also true that 
these are the only two points in the image of [/(/, K) in S(I, R) that achieve this 
maximum distance! This follows from the material in Sections 7 and 8 for step 
functions, and then for arbitrary functions by the fact that the step functions are 
dense. 


5. Optimal Matching for Step Functions I -a R n 

Given two elements [117] and [w^] of S(I, R N ), it is a basic problem to calculate 
the distance between them and to find a minimal geodesic joining them, if such 
a geodesic exists. The most straightforward way to do this is to find elements 
91 € [iCi] and 92 £ [1V2] such that cf(91,92) = d([iOi], [102] )• In this section, we 
will prove that if [117] and [W2] are elements of S(I, R^), and at least one of them 
is an element of S st (I, R N ), then there exist 91 £ [117] and 92 £ [1C2] such that 
d (91, 92) = d([«7], [^2])- We will also prove that if both [117] and [u^] are elements 
of S st fI,^ N ), then these representatives 91 and q 2 can both be taken to be step 
functions. We begin with a lemma. 

Lemma 15. Let q £ L 2 (I,R N ), and letw : I -A R N be a constant map, w(t) = wo- 
Express I as a disjoint union of two measurable sets, I = A\J B, where A = {f £ 
I : q(t) ■ Wq > 0} and B = {f £ I : q(t) ■ w 0 < 0}. 

Then 


sup (q,w) = \ (q(t ) • w 0 ) 2 dt. 

y JA 

If q(t) -wo = 0 almost everywhere on I, then this supremum is 0, and is realized by 
any 9 £ [9] and w £ [id] . If it is not true that q(t ) • Wo = 0 almost everywhere on 
I, then this supremum is realized by 9 = 9 and w = w * 7, where 7 (f) = J* F(u)du, 
and F(u) is defined by 


F(u) 


0 

(q(u)-Wo ) 2 

f A (q(u)-w 0 ) 2 du 


for U £ B 
for u £ A 


Proof. Since 9T is dense in [9] and tcF is dense in [w] and the group T acts by 
isometries, it follows that 


sup (q,w) = sup (9 * A, w * 7) = sup(9, w * 7) = sup(9, w * 7). 

q&[q],we[w] Aer, 7 er 7gr 
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Let 7 £ r be arbitrary. Then 


(q,w*j) = [ q-(w* 7)= [(q(t)-w 0 )YY (t)dt = [ (q(t)-wo) vV(i)+ [ (q{t)"w 0 )y/y(t) 
Ji Ji Ja Jb 


The integral over B is clearly bounded above by 0 , since q(t) ■ wq < 0 for t £ B. 
The integral over A can be bounded using the Cauchy Schwarz inequality to yield: 



(q(t) ■ w 0 ) 2 dtJ / 7 '(t)dt <\ ( q(t ) • w 0 ) 2 dt, 


where the last step uses the fact that f A Y(t)dt < J T Y(t)dt = 1. A straightforward 
calculation shows that the upper bound is actually achieved by the 7 defined in the 
statement of the lemma. 

□ 


We will need a slightly altered form of Lemma [T5| We state it as a corollary. 

Corollary 5. Suppose H, J, and K are finite closed intervals in R; denote by 
L(J) the length of J. Let q £ L 2 (H,WL n ), and let w : J -A M. N be a constant map, 
w(t) = Wq . Then 


sup (q * A, w * 7) = J / (q{t)- w 0 ) 2 dt^L(J) 
a , 7 y Ja 

where the supremum is taken over all A : K -A H and 7 : K —> J such that A and 
7 are absolutely continuous, onto, and weakly increasing, and A is the subset of 
H on which the function q(t) ■ Wq is non-negative. Furthermore, this supremum is 
actually realized by an appropriate choice of X and 7. 

Proof. We omit the details of this routine proof. The idea is just to transform each 
of the three intervals into I using linear bijections, and then use integration by 
substitution and apply Lemma p~5| 

□ 


Theorem 4. Let [q] £ S(I, R N ) and [ic] £ S st (I,'M. N ). Then there exist q £ [q] and 
w £ [w] such that d(q,w ) = d([q], [in]). 


Proof. Assume our orbit representatives q and w correspond to unit speed parametrized 
curves, so that |q(t)| and |u>(f)| are constant (a.e.) in I. (These representatives 
exist by Theorem |TJ) It follows from this that w is a step function. This is 
because, since [u>] £ S st (I,M. N ), w can only assume a finite sequence of differ¬ 
ent directions. Since its magnitude is constrained to be constant everywhere, it 
follows that w assumes only a finite sequence of different values. Hence, there 
exists a finite sequence of real numbers 0 = So < Si < ■ ■ ■ < Sfc = 1 and 
corresponding finite sequence W\,W2, ■ ■ ■ ,Wk of vectors in such that for all 
j £ {l,...,fc}, w(t) = Wj for all t £ ( Sj-i,Sj ). Define a set T £ R fc+1 by 
T = {(to, ■ ■ ■ ,tk) ■ 0 = to < t± < <2 < • • ■ < tk = 1 }. Clearly T is compact. 
Now define a function M : T —► R by 


M(to, ■ ■ ■ ,tk) — \ f («(*) • w jY d W 

7=1 V 1 


s i ~ s i- 1 - 
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This function is obviously continuous as a function of (to, • • •, £&); since T is com¬ 
pact, M attains a maximum at some element (t 0 , Let M = M(t 0 ,..., tf.) 

We will show first that there exists q £ [q] and ui £ [u>] such that M = (g, w) ; we 
will then show that it is the maximum possible value of all such inner products. 
This will complete the proof of the theorem. 

Fix j £ {l,...,fc}. In order to apply Corollary [HJ let H = [tj-\ ,t,], J = 
Sj], and K = i]. Then Corollary [H] tells us that there exist A j : K -A H 

and 7 j : K —> J (where A j and 7 j are onto and absolutely continuous) such that 


IK 


(( q\H ) * A j)(u) ■ (H J) * 'Y j )(u)du = 



(q(t) ■ Wj^dty/Sj - Sj- 1 , 


and that this integral is the maximum possible over all such A j and 7j. Since for 
each j, = A J _i(^ : ) and 7j(^r—) = 7^-1 (^—), it follows that we can glue 

together the A j ’s to form a single A : / —^ J and can also glue together the 7 j ’s to 
form a single 7 :/—>•/ such that (q * A, w * 7) = M. This shows that M is realized 
as an inner product of a pair of orbit representatives. 

We now show that M gives the maximum value of the inner product, for all orbit 
representatives. Suppose q £ [g] and w £ [w]. By Corollary [3J there exist A £ f and 
7 £ T such that q = q* A and u> = w * 7. Since 7 is onto, for each j we can choose 
Uj £ 7 _1 (sj), an d let tj = It follows that for each j, A([mj_i, uj]) = [tj-\,tj], 

and 7([itj_i, Uj]) = [sj_i, Sj]. By Corollary [5} we may conclude that 

r u o I rij 

/ ((9lfe-i, i iD*A)(u)-((w|[s i _i,s J ])*7)(u)du < J / (q(t) ■ Wj) 2 dty/sj - sj- 1. 


Summing over all j then gives 
r 1 


(q * A )(u) ■ (w * "f)(u)du < M(to ,..., tk) < M. 


/ 0 


Since maximizing the L 2 inner product is the same as minimizing the distance, this 
completes the proof of the theorem. 

□ 


Theorem 5. Ifq,w £ L 2 (I, K w ) are both step functions, then there exist piecewise 
linear functions A, 7 £ T such that 

(g * A, w * 7 ) = sup ( q,u >). 

q£[q\,w£[w\ 

Proof. In the statement of Lemma |15| note that if q is a step function, then the 
function we integrate to get 7 is also a step function. It follows that 7 is piecewise 
linear. In Corollary [5] (still assuming that q is a step function), the reparametriz¬ 
ing functions are obtained from the ones in Lemma [15] by composing with linear 
functions; hence the reparametrizing functions are still piecewise linear. Finally, 
in Theorem [4] the optimal reparametrizing functions are obtained by gluing to¬ 
gether reparametrizing functions of the type formed in Corollary [5j gluing together 
piecewise linear functions results in more piecewise linear functions. 

□ 
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6. Preliminaries on Finding a Precise Optimal Matching For 
Piecewise Linear Functions 

Let /i and fi be two continuous, piecewise linear functions I and let 

gi, <72 £ L 1 2 (I, R n ) be their SRVFs. We will develop an algorithm which will produce 
a pair of optimal representatives for [<71] and [<72]) be., L 2 functions q\ £ [<71] and 
<72 £ [<72] such that d(qi, <72) = d([gi], [92])- Assume that q± and 52 have the property 
that the set on which each of them vanishes has measure 0 . (If this is not true, 
than we can replace them by elements of [gi] and [<72] that have this property, using 
Theorem [I]) According to Theorem | 3 j these optimal representatives will be of the 
form <71 = <71 * 71 and <72 = <72 * 72, where 71,72 £ T. We call such a pair (71,72) an 
optimal matching for /i,/2 (or for qi,qi). 

Since /1 and /2 are piecewise linear, we know that there are subdivisions 0 = 
so < Si < ■ • • < s m = 1 and 0 = to < t\ < ■ ■ ■ < t n = 1 such that /1 is linear on each 
subinterval [si_i,s,;] and fi is linear on each subinterval [tj„%,tj]. As a result, we 
know that <71 is constant on each open interval (sj_i, si) and <72 is constant on each 
(tj-i,tj). In general, q\ and <72 are not defined on the endpoints of these intervals, 
since f\ and / 2 are not differentiable at these endpoints. For each i = 1,..., m, let 
u i = < 7 i((si_i, Si)) and for each j = 1,... ,n, let Vj = qi((tj-i,tj)). We then define 
an n x m matrix W, called the weight matrix , by Wij = Ui ■ Vj. (The dot product 
here is the ordinary inner product in R N .) 

A matching of fi and fi is any pair of reparametrizations 71,72 £ T. Such a 
pair represents a matching in the sense that for each z £ I, the point fi{”fi(z)) 
on the curve parametrized by fi is “matched” to the point fii^iz)) on the curve 
parametrized by fi- Note that because 71 and 72 are only weakly increasing, 
this matching does not give a 1-1 correspondence between the points on these two 
curves. We can assemble 71 and 72 into a single function 7 : I —> I x I defined 
by 7(z) = (71(2),72(2)). This function can be thought of as a parametrized curve 
in I x I that starts at (0,0) and ends at (1,1). Because 71 and 72 are weakly 
increasing, this curve can only move vertically upward, horizontally to the right, 
or in some diagonal direction towards the upper right. We define a vertex of / x I 
to be a point of the form ( Si,tj ), a horizontal gridline to be a line of the form 
t- = tj ( i = 0, 1,... ,n), and a vertical gridline to be a line of the form s = Si 
(j = 0,1,..., m). We define the ij-block, Gij , by Gij = [si_i,Sj] x [tj_i,tj\. 
Because of the weakly increasing nature of 71 and 72, it is clear that 7 -1 (G.y) is 
always a closed subinterval of I. If 7 is linear and non-constant on an interval [a, 6], 
we define the slope of 7 on this interval to be the value of 72/71- On any such 
interval, this slope will be 0, positive, or 00 . 

Given a matching 7, we define a segment of 7 to be the restriction of 7 to some 
closed subinterval of I. We now define two specific types of segment. 

Definition of P-segment : (Note that this is a long definition! It includes all the 
statements up until the definition of an N-segment.) A P-segment is a restriction 
of 7 to an interval [a, b] C /, which has the following properties: 


( 1 ) 7 I [a,6 ] is piecewise linear and injective. 

( 2 ) 7 (a) = (si 0 _i, tj 0 -i) and 7 (b) = (s^,^) are vertices, with io < ii and 
jo < ji, but for all 2 £ (a, b), 7 (z) is not a vertex. Furthermore, Wi 0 j 0 > 0 
and W il j 1 > 0. 
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Figure 1. The grid 


(3) For all blocks Gj such that 7 _1 (Gy) C [a, 6 ], the restriction of 7 to 
7 is linear. We define H l :l to be the slope of the segment as it 
passes through G,j. 

(4) Suppose 7 _1 (G^) = [c, d] C [a, b], where c < d. If Wij < 0, then either 
Hij = 0 or Hi j = 00 . Visually, this says that the parametrized path 7 is 
either vertical or horizontal as it traverses Gj. More precisely, if 7 enters 
such a Gij through the left hand vertical edge, then = 0 , while if 7 
enters such a G (J through the lower horizontal edge, then H,.j = 00 . If 
Wij > 0, then H t ,j is not equal to either 0 or 00 . 

To understand the remaining properties required of a P-segment, note that it 
begins at the vertex (sj 0 _i,t,- 0 _i) and passes through the block Gi 0 j 0 in a linear 
fashion with slope H io j 0 , which is equal neither to 0 nor to 00 by the previous items. 
Up to reparametrization, the remaining portion of the P-segment is completely 
determined by the initial vertex (sj 0 _i, and the initial slope as the segment 

passes through Gi 0 j 0 . To understand this determination we will describe how the 
slope Hi j is required to change as the P-segment passes through a gridline from 
one block to another. First, suppose the segment passes through a vertical gridline 
from Gj j to G l+ i j. There are then three cases to consider: 

Both Wij and Wj + \j are greater than 0. Then the slopes are related as 
follows: 

H i+hj _ f W i+ i,A 2 

Hij \ Wij ) 

Wij-ij < 0. Then H i+ i j = 0. 

Wij < 0 while Wi+ij > 0. By one of the above conditions, we know that 
Hij = 0. To determine Hi + i j, we must find the largest value of k < i for 
which for which Wkj > 0. (This corresponds to the last block G^j that 
the segment passed through with non-zero slope.) Using this value of k. 
Hi + ij must then satisfy 

Hj+1J = (Wi+iA 2 

H k j \ W kJ ) ■ 

Now, suppose the segment passes through a horizontal gridline from G i : j to Gij + \. 
The three cases are completely analogous to the cases of the vertical gridline: 


( 1 ) 


(3) 


( 2 ) 

(3) 


(4) 
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(1) Both Wij and Wij+i are greater than 0. Then the slopes are related as 
follows: 


( 5 ) 


Hij +1 _ ( W id \ 2 

Hi,j 


(2) Wij+i < 0. Then Hij +1 = oo. 

(3) Wij < 0 while W t j + i > 0. By one of the above conditions, we know that 

Hi j = oo. To determine . we must find the largest value of k < j 

for which for which W, ^ > 0. (This corresponds to the last block that 
the segment passed through with non-infinite slope.) Using this value of k, 
Hij + 1 must then satisfy 


( 6 ) 


H iJ+ 1 _ / W itk \ 2 

Hi, k \WiJ+iJ 


This concludes the definition of a P-segment! 



1 - - 

± 

- - - - 

+ 










£ 
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Figure 2 . Left: P-segment. Right: N-segment(in red). 

Definition of N-segment: An N-segment is a restriction of 7 to an interval 
[a, b] C /, which has the following three properties: 

(1) 7(a) = ( Si 0 ,tj a ) and 7(6) = (sjjjtjj) are both vertices, with zo < i\ and 

jo < ji- 

( 2 ) The restriction of 7 to [a, g ^-] is linear and runs horizontally from (s, 0 , tj 0 ) 
to (sj 1 ,tj 0 ), while the restriction of 7 to [^^,6] is also linear and runs 
vertically from (s^,^) to (s^,^). For the special cases in which i 0 = i\ 
or jo = j 1, the entire N-segment is either vertical or horizontal, respectively. 

(3) For 7 I[ a b ] to be an N-segment, there are also the following requirements 
on certain weights: if i £ {*o + 1 ,... ,zi} and j £ {jo, ■ • •, ji + 1}, then 
Wij < 0. Also, if z £ {zo, • • •, i\ + 1} and j £ {jo +1 ..., jj}, then Wij < 0. 

7. Statement and Proof of Main Theorem 

In this section, we state and prove our main result on a canonical form for optimal 
matchings between piecewise linear curves. 

Theorem 6. Let fi and jj be piecewise linear functions I —> Then there 

exists an optimal matching 7 = (71,72) that has the following properties: 

( 1 ) 7 zs a sequence of P-segments and N-segments; i.e., there exists a partition 
{0 = iio < u\ < ■ ■ ■ < ztfc = 1} such that for each i = 1,..., k, 7|r Ui _ 1 Ui i is 
either a P-segment or an N-segment. 
















24 


SAYANI LAHIRI, DANIEL ROBINSON, AND ERIC KLASSEN 


(2) 7 does not contain two consecutive N-segments. 

(3) Suppose that 7 |[ Ulitl2 ] and 7 |[ U3 , U4 ] are both P-segments, and suppose that 
either U 2 = u$ or 7 |[ tt 2 , t i 3 ] is an N-segment. Define (i\,j\) and { 12 ^ 2 ) by 
(sii,An) = 7(^2) and ( s i2 ,t h ) = 7(1x3) . Then the final slope of 7| [ „ li „ 2 ] 
and the initial slope of 7 |[„ 3jll4 ] must be related as follows. Let A = 

B ^x 2 -(-i,j 2 +i > ^ VI):, ,j 2 +i ? and D VFt 2 +i ,.71 * Then Hi 2 j r \j 2 j r \ 

p 2 Hi 1 where 

(4) 

, if C > 0, D > 0 
,ifD< 0,(7 > 0 
, ifD > 0,(7 < 0 
,ifD< 0,(7 < 0 

Note that the prescribed p-interval is empty if CD > AB. In that case, there 
cannot be an optimal matching with one P-segment ending at (s^ftj,) and 
the next P-segment beginning at (si 2 ,tj 2 ) 

Proof. We know by Theorem[ 5 ]that there exists a piecewise linear optimal matching 
between /1 and f 2 . Choose such an optimal matching and call it 7 = (71,72) : I —► 
I x I. We may assume that 7 is injective by replacing it by a constant speed 
reparametrization. Let Vo, V),..., V\,j be an ordered list of all the vertices through 
which 7 passes, starting with Vo = (0,0) and ending with Vm = ( 1 , 1 )- From this 
list, choose an arbitrary vertex V) (with i qL { 0 , M}). If either the portion of 7 
from Vi -1 to Vi, or the portion of 7 from V) to V)+1 passes through a point in 
the interior of some block Gk,i with weight Wk,i > 0, then retain V) in the list. If 
neither of these portions of 7 pass through such a point, then drop V); from the list. 
Continue this elimination process until no more vertices can be dropped. Renumber 
the remaining vertices and revise the number M to reflect the number of vertices 
remaining in the list. The remaining vertices now have the property that for each 
i = 1 ,... ,M — 1 , either the segment of 7 from V)_i to V), or the segment from V) 
to Vj_|_i passes through at least one point in the interior of some block G^j with 
weight Wf c j > 0 . 

For each i = 0,..., M — 1, consider the segment of 7 from V) to V)+ 1 . There are 
two possibilities: 

(1) Type I: If this segment of 7 passes through a point in the interior of some 
block Gk,i with weight Wk,i > 0 , we will prove that it can be replaced by a 
P-segment without affecting the optimality, i.e., without affecting the value 
of fg(q i * 7 i)(u) • (q 2 * 7 2 ){u)du. 

(2) Type II: If this segment of 7 does not pass through such a point, then we 
will prove that it can be replaced by an N-segment, without affecting the 
optimality, i.e., without affecting the value of fg(q 1 * 7 i)(m) ■ (q 2 * 72 )(u)du. 

Lemma 16. Let v,w £ be two vectors, and define two constant functions 
qi : [a, b] —> R w and 92 : [c, d] —> by qi(s) = v and q 2 {t) = w. Let a < (3; 

define 71 : [a,/J] -A [a, b] to be the unique linear function such that 71 (a) = a and 
7 i(/ 3 ) = b and define 72 : [a,/3] —> [ c,d] to be the unique linear function such that 
72 (a) = c and 72 (/ 3 ) = d. Then Xf (91 * 7 i)(u) • (q 2 * 72 )(u)du = (v-w)y/b — a\Jd — c. 


(7) 


/i £ < 


D 2 AB 
AB > C 2 
0 AB] 

y, c 2 J 

D_ 

AB > 

0, OO 


, OO 
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Proof. This is an easy calculation since ( q\ * 71 )(it) = V\J (b — a)/(/3 — a) and (q 2 * 
7 2 )(u) = Wy/(d-c)/{P — a) are constant functions! □ 


Lemma 17. Let v,w £ be two vectors satisfying v ■ w > 0 and define two 
constant functions q\ : [a, 6] — > M. N and q 2 : [c,d\ —> by qi{s) = v and q 2 {t) = w. 
Let 71 : [a, /?] —> [a, b] and 72 : [a, /?] —)• [c, d] be surjective absolutely continuous 
functions with both 7 ^( 14 ) > 0 and "f 2 ( u ) > 0 almost everywhere for u £ [a,/?]. 
Let 71 and 72 be the unique linear bijections [a,/3] —> [a,b\ and [a,/3] —t [c, d], 
respectively. 

Then 

rfi rP 

/ (9i*7i)(w)-(q2*72)(u)du < / (9i*7i)(uM92*72)(w)du = ( v-w)Vb- ay/d- c. 

J Ot J Ot 


Proof. The main tool here is the Cauchy-Schwarz inequality. Note that for u £ 
[a, 1 3], ( q\ * 7 i)(u) = (u)v and (52 * 72 )(u) = i u ) w - We then compute: 



(9i*7i)(w)'(92*72)(w)du 


v-w^i 1 {u)^i 2 {u)du = v-w f \J'y , 1 {u)^j , 2 (u)d'L 


< V 


■ W \J J 7i (u)du)j J y 2 ( u ) du = v ■ W V 7i (P) -7i(a)V , 72(/3) - 72 (a) 


= v ■ w\/b — ay/d — c 


where the inequality is just the Cauchy-Schwarz inequality. Finally, note that if we 
replace each 7 , by 7 ,; for i = 1 , 2 , then since each 7 ' is a positive constant function, 
the Cauchy-Schwarz inequality is actually an equality. □ 


Lemma 18. Suppose 7 passes through a point in the interior of Gk,i for which 
Wk,i > 0. It follows that 7 ~ 1 (Gk,i) = [cq/3], where a < (3. If we replace 7|[a,/31 by 
the unique linear map 7 : [a,/?] — > Gk,i that agrees with 7 at a and j3, then 
rfi rP 

/ (di * 7 i)(w) ’ (d2 * 72)(w)dM > / (91 * 7 i)(«) • (92 * ^ 2 ){u)du. 

J Ot J Ot 

Since we are assuming that 7 is optimal, it follows that this inequality is actually 
an equality, so we can replace 7 | [«,/?] by the linear map 7 without affecting its 
optimality. 


Proof. Let a = 71 (a), b = 7 i(/ 3 ), c = 72 (a), and d = 72 (/3). Since 7 ([a,/ 3 ]) C G k ,i, 
it follows that q\ = v is constant on [o, b] and q 2 = w is constant on [c, d]. Also, since 


Wk,i > 0, we know that v ■ w > 0. Then, from Theorem 17 it follows immediately 
that 


(91 * 71) (u) ■ (92 * 72 ){u)du > (qi* 7i)(u) • (92 * 72) (u)du. 


The rest of the Lemma follows from this. 

There is one other small point to consider here; in our proof, we tacitly assumed 
that qi and q 2 are defined on all of 7 i([a,/ 3 ]) and 72 ([a,/ 3 ]), respectively. However, 
in our case either q\ or q 2 will fail to be defined at points along the boundary of 
the block. As a result, one should consider separately the possibility of a 7 that 
stays along the edge of Gk.i for either an initial portion or a final portion of [a,/3]. 
However it is not possible for such an 7 to achieve a higher value for the integral 
in question. The reason is that the contribution of the integral along the edge of 
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Gk.i will always be zero (since in these regions either or 72 will vanish). And in 
the remainder of the integral corresponding to such a 7, the value of b — a and/or 
the value of d — c will have to be reduced, which will result in a reduction of the 


maximum value of the integral as given in Lemma 16 


□ 


Lemma 19. Suppose 7 passes through a point in the interior of G k y for which 
Wk,i < 0. It follows that r y~ 1 (Gk,i) = [a,/3], where a < ( 3 . If we replace 7|[a,/31 by 
a continuous piecewise linear 7 that agrees with 7 on a and (d but is made up of 
a finite sequence of vertical (upwards) and horizontal (to the right) segments, then 
the resulting 7 will still be optimal. 


Proof. Because 7([a,/3]) C G k ,i, it follows that 



(?i * 7i)(«) ’ (?2 * 72 )(u)du 


(Vk) ■ {wi)yji 1 {u)^ji 2 {u)du < 0 , 


since we are assuming that (v k ) • (wi) = W k ,i < 0. However, note that 


r 0 

/ (91 * 7i)(w) • (<?2 * ii ){ u)du = 0, 

J OL 

since for all u € [a,/?], either 7j(u) = 0 or 72(11) = 0. Since 7 is assumed to be 
optimal, it follows that f 0 (q 1 * 7i)(w) • (<72 * 72 )(u)du = 0, and this contribution 
doesn’t change if we replace 7|[«,/?] by 7. 

□ 


Now, suppose we have an optimal matching 7, and within that 7 we have chosen a 
segment, 7I[ a b ], of Type I. We have proved that we can replace this segment of 7 
with an equally optimal segment that is linear each time it passes through a block 
Gk,i for which W k ,i > 0, and that is a finite sequence of horizontal and vertical 
segments each time it passes through a block G k y for which W k ,i < 0. So assume 
7I [a,b] has these properties. We claim that there is at least one Gk.i, with W k ,i > 0, 
that our segment passes through with positive, non-infinite slope. To prove this 
claim, note that if no such Wy*, exists, then J^{qi * 7i)(w) • (<72 * 72 )(u)du = 0. 
But then, replacing 71 [ a ^by a path that uses a sequence of horizontal and vertical 
segments to get from 7(a) to (sj_i, t k -i), then a diagonal line from (sj_i, ffc-i) 
to ( Sj,tk ), and then a sequence of horizontal and vertical segments to get from 
( Sj,tk ) to 7 (b), would result in a positive integral over this segment, contradicting 
optimality. 

Thus, choose a block G k y, with W^ i > 0, that our segment passes through with 
positive, non-infinite slope. If our entire segment 7|[ a ,&] passes from the lower left 
vertex of this block to the upper right vertex, then 7I is a diagonal line joining 
these vertices, proving it is a P-segment. So, assume 7I [ a> 6] either enters or exits 
Gk,i through a point on an edge that is not a vertex. Just to be specific, assume 
that y I [a, &] exits Gk,i through a point on its right edge, which would be of the form 
( Sk,t *), where ti -1 < t* < ti . Our next task to to examine what happens to 7][ a ,6] 
as it passes through the next block to the right, Gk+i,i- First, consider the case 
in which Wk+1,1 > 0. In that case, by Lemma |18| we know 7 is linear as it passes 
through G'fc +1 ;. The following Lemma tells us the relationship between the slopes 
H k j and H k+ i iZ as 7 passes through these blocks. 
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Lemma 20. (1) Assume that the adjacent blocks G k ,i and G k +i,i both have 

positive weights, and suppose that an optimal 7 passes from G k> i to G k+ \,i 
at the point ( Sk,t*), where ti -1 < t* <ti. Furthermore, assume that 7 has 
positive and non-infinite slope in at least one of these two adjacent blocks. 
Then the slope of 7 in the other block is also positive and non-infinite, and 
these two slopes are related by 

ro , H k+hl fW k+u \ 2 

(8) 0S7 “ VwiT) 

(2) Assume that the adjacent blocks G k ,i and G k ,i+ 1 both have positive weights, 
and suppose that an optimal 7 passes from G k ,i to G k j + 1 at the point 
(s*,ti), where s k -1 < s* < s k . Furthermore, assume that 7 has positive 
and non-infinite slope in at least one of these two adjacent blocks. Then 
the slope of 7 in the other block is also positive and non-infinite, and these 
two slopes are related by 

/ni H kil+ 1 _ f W k>l V 

(9) ^7 " [wk^J 


Proof. We assume that the adjacent blocks G k ,\ and G k+ iy both have positive 
weights, and that 7 passes from G k ,i to G k +i,i at the point ( s k ,t *), where ti- 1 < 
t* < ti. Furthermore, we assume that 7 has positive and nonzero slope in G k ,i- 
First, we will show that 7 must pass through an interior point of G k +i,i • If it 
doesn’t, then it would have to follow a vertical path in the left edge of G k +i,i , 
which is the same as the right edge of G k ,i ; but this would violate the fact that is 
it linear while in G k ^. So choose a and (3 such that 7(a) = (01,71) is an interior 
point of G k ,i and 7 (/ 3 ) = (<72>T2) is an interior point of G k +i,i- It follows that 



{qi*'yi){u)-(q 2 * r y 2 )(u)du 


w k>l \J (s k - cri)(t* - n)+W k+li i vV 2 -Sfc)(T 2 -t*) 


by Lemma []~6j If we view the above formula as a function of a single variable t*, it 
is an easy Calc I problem to show that the value of the integral is maximized when 
we choose t* so that 


(j 2 -1*) = f W k+1 A 2 (f-n) 
(<72 ~S k ) V W k,l J (S k ~<7 1)' 


Since we are assuming that 7 is optimal, it follows that this slope relationship must 
hold. The other cases of the Lemma follow by analogous arguments. □ 


Given a Type I matching, we have shown that it must pass through an interior 
point of a block G k ,i, of positive weight, with a slope that is neither zero nor infinity. 
As we follow this segment in either direction, Lemma [ 20 ] tells us how the slope of 7 
must change, as long as it enters new blocks of positive weight through non-vertex 
edge points. (Of course, if it meets a vertex, that terminates our Type I segment.) 
We now address the question of what happens when a matching passes from a block 
of positive weight (which it traverses a slope that is neither zero nor infinity) to a 
block with non-positive weight. 

Lemma 21. ( 1 ) Suppose an optimal matching 7 passes from a block G k ,i with 

to a block Gk+1,1 at a point ( s k ,t*), where f/_i < t* < t+ Assume that one 
of these blocks has positive weight, and the other has non-positive weight. 
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Also, assume that the slope of 7 in the block with positive weight is non¬ 
zero and non-infinite. Then the slope of 7 in the block with non-positive 
weight is zero; hence, 7 traverses the block with non-positive weight along 
the horizontal line segment t = t*. 

(2) Suppose an optimal matching 7 passes from a block Gk,i to a block Gk,i+i at 
a point ( s*,ti ), where Sk-i < s* < Sk■ Assume that one of these blocks has 
positive weight, and the other has non-positive weight. Also, assume that 
the slope of 7 in the block with positive weight is non-zero and non-infinite. 
Then the slope of 7 in the block with non-positive weight is infinite; hence, 7 
traverses the block with non-positive weight along the vertical line segment 


Proof. Suppose we are in the first case. Also, to be definite, assume that Wk,i > 0 
while Wk+1,1 < 0 and that the slope of 7 in Gk,i is non-zero and non-infinite. We 
proceed by contradiction; suppose that 7 exits Gk+1.1 at a point other than ( Sk,t*). 
In that case, the exit point must be of the form (s, t), where s > Sk and t > t*. 

By Lemma Jl 9 | we know that the portion of 7 passing through Gk+1,1 will con¬ 
tribute 0 to f (q 1 * 7i)(u) • (<72 * 72 ){u)du. Consider what happens if we replace 
the portion of 7 passing through these two blocks by a segment that enters Gk,i 
at the same entry point as 7, passes linearly through Gk,i to the point ( Sk,t ), and 
then proceeds through Gk+1,1 by the horizontal segment from ( Sk,t ) to (s,t). This 


replacement will increase the integral f (q 1 * 7i)(«) • (92 * 72 ){u)du, since it will 
increase the contribution of the portion of 7 in Gk,i (by Lemma 16 1, while not 
changing the contribution of the portion in Gk+1,1, which will still be zero. Thus 
we contradict the optimality of the original 7, and the proof of Case ( 1 ) of the 
Lemma is complete. The proof of Case ( 2 ) is analogous and we omit it. 


□ 


Thus, given a Type I segment, we know it passes through an interior point of a 
block Gk,i , of positive weight, with a slope that is neither zero nor infinity. Following 
this segment in both directions, we know precisely what happens to this segment 
as if it encounters a block of positive weight or a block of negative weight. What 
happens if it encounters several blocks of non-positive weight in a row? 


Lemma 22. (1) Suppose Gk,i and G p ,i are blocks of positive weight, where 

k < p, and suppose that the intervening blocks Gk+1,1, Gk+2,1, ■■■ ,G p -i,i 
all have non-positive weights. If 7 passes through Gk,i with non-zero and 
non-infinite slope, and meets the boundary of Gk,i at the point ( Sk,t *), 
where < t* < fi, then 7 proceeds through all the intervening blocks 
Gk+1,1, Gk+2,1, ■ ■ •, Gp-1,1 with slope 0 (along the horizontal line t = t*), 
and then passes through the block G p ,i with slope related to the slope in Gk,i 
by the formula 


(10) 


Hpl = (w p , A 2 
H k ,i \Wk,J 


If instead of assuming 7 passes through Gk,i with positive, non-infinite slope, 
we assume that it passes through G p ,i with positive, non-infinite slope, then 
we can again conclude that it passes through the intervening blocks with 
slope 0 and passes through Gk,i with positive, non-infinite slope, and that 
these slopes are related by the same equation. 
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( 2 ) Suppose Gk,i and Gk, P are blocks of positive weight, where l < p, and 
suppose that the intervening blocks Gk,i+ 1, Gk,i+ 2, • ■ •, Gk, p -i all have non¬ 
positive weights. If 7 passes through Gk,i with non-zero and non-infinite 
slope, and meets the boundary of Gk,i at the point ( s*,ti), where Sk-i < 
s* < Sk, then 7 proceeds through all the intervening blocks Gk,i+ 1, Gk,i+ 2, • • ■, Gk, P 
with slope 00 (along the vertical line s = s*), and then passes through the 
block G k , p with slope related to the slope in Gk^i by the formula 



( 11 ) 


If instead of assuming 7 passes through Gk,i with positive, non-infinite slope, 
we assume that it passes through Gk, P with positive, non-infinite slope, then 
we can again conclude that it passes through the intervening blocks with 
slope 00 and passes through Gk,i with positive, non-infinite slope, and that 
these slopes are related by the same equation. 

Note that this Lemma contains Lemma[ 20 ]as the special case in which the number 
of intervening blocks (with non-positive slopes) is zero. 

Proof. For definiteness, assume we are in Case ( 1 ) of the lemma. The proof that 7 
continues with slope 0 through all the intervening blocks with non-positive weights 
is the same as the proof of Lemma [ 21 ] if not, we could replace 7 by a matching 
would violate the optimality of the 7. Now that we know that 7 has zero slope 
through the intervening blocks, the proof of the relationship between the slopes in 
Gk,i and G Pt i is identical to the proof of the relationship in Lemma |20| the only 
modification being that we let t* represent that height of the horizontal line instead 
of just the height of the transition point. Case ( 2 ) is completely analogous and we 
omit its proof. □ 

If we are given a Type I segment, we have shown it passes through an interior 
point of a block Gk,i , of positive weight, with a slope that is neither zero nor infinity. 
Following the segment from this block in each direction, we have now proved that 
until it encounters a vertex, it must follow the definition of a P-segment. Of course 
when it encounters a vertex in either direction, that will be the end of the Type I 
segment. Thus, we have proved that each Type I segment is a P-segment. 

We now turn to the proof that each Type II segment can be replaced by an 
N-segment without altering its optimality. We start with an optimal matching 7. 
Assume that our Type II segment is 7[ a ,b] ■ Recall from the definition of a Type II 
segment, that it starts at a vertex, ends at a vertex, and never passes through an 
interior point of a block with positive weight. Also, we know that we cannot have 
two consecutive Type II segments, so if it is preceded by a segment, that segment 
is now known to be a P-segment, and if it is followed by a segment, that segment is 
known to be a P-segment. Let 7 (a) = (s p _i,t 9 _i) and let 7(6) = ( Sk,ti ). Because 
7[o,fc] does not pass through an interior point of any block with positive weight, we 
know that f^(qi * 7 i)(u) ■ (92 * 72 ){u)du < 0 (since the integrand is non-positive 
almost everywhere). However, if we replaced 7[ 0 ,&] by a horizontal segment from 
(sj_i, tj-i) to (sk,tj- 1) followed by a vertical segment from (s p ,t q - 1) to ( Sk,U ), 
then f (qi * 7i)(u) • (92 * 72 )(u)du = 0; by the optimality of 7, it follows that for 
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our Type II segment, J^(q i * 7 i)(u) • (92 * l2){v)du = 0 , and we may make this 
replacement without affecting the optimality. 

Lemma 23 . Ifj is optimal and 7[ a ,f>] is a Type II segment from the vertex (s p _i, t q _ 1) 
to the vertex (sk,ti) then Wjj < 0 for all i,j satisfying p < i < k and q < j < l. 

Proof. Suppose not; choose (i. j) such that p < i < k and q < j < l but Wij > 0 . 
Then, if we replace 7|r a w by a segment that starts at (s p _i, t 9 _i), then proceeds 
by first a horizontal segment and then a vertical segment to then by a 

linear segment from to and then by first a horizontal segment 

and then a vertical segment to (sk,ti), we will increase the value of this integral 
from 0 to a positive number. This contradicts the optimality of 7, and proves the 
Lemma. □ 

To satisfy the definition of N-segment, we need to prove a few more weights are 
< 0. 

Lemma 24 . If'y is optimal and 7[ a ,f>] is a Type II segment from the vertex (s p _i, 1) 

to the vertex (Sfc,t;) then < 0 for all i,j satisfying any one of the following 
conditions: 

• p < i < k and j = q — 1 

• p < i < k and j = l + 1 

• i = p — 1 and q < j < l 

• i = k + 1 and q < j < l 

Note that in some cases one or more of these conditions may be vacuous; for 
example, if q = 0 , then there is no block G l;j satisfying j = q — 1 . 

Proof. The proof is the same for all four conditions, so consider the first one. 
Proceed by contradiction; suppose that > 0 , where p < i < k and j = q — 1 . 
Assume that 7I[ a> b] takes the form of a horizontal segment from (s p _i,tq~i) to 
(sfc, t a - 1), and then a vertical segment from (sfc, t q - 1) to ( Sk,U). (We know that by 
Lemma 23 f^(qi * 7 i)(u) ■ (92 * 72 )(u)du < 0 ; since the segment described makes 
the integral equal to zero, it is an optimal one.) Since q — 1 > 0 (so q > 1 ) in 
this case, we know that our current Type II segment has a segment preceding it, 
and we have proved that this preceding segment is a P-segment. We know that 
G p _i, 9 -i is the last block that this preceding P-segment passed through, and we 
also know that because it was a P-segment, i, 9 -i > 0 , and H p _ i, q -i is positive 
and finite. Let a < a be the lowest parameter value for which 7(a) £ G p -i.q-i- Let 
(s,t) = 7(a). Since the slope of 7 in G p _i, q -i is positive, we know that t < t q _ 1. 
Now, focus attention on the segment of 7 from (s, t) to (si,t q - 1). This segment 
consists of a straight line segment (of positive slope) from(s, t) to (s p _i,t 9 _i), 
followed by a horizontal line segment from (s p _i,t 9 _i) to (sj,t g _i). For any h 
satisfying t <h< t q _ 1, define a segment 7^ consisting of a straight line from (s, £) 
to (s p _i,/i), followed by a horizontal line from (s p -i,h) to (sj_i,/i), followed by a 
straight line from (si_i,/i) to (si,t q - 1). 

By Lemma [l6| the contribution of the segment 7 h to the integral in question is 

C(h) = W p -l^ q -lyJ Sp-l - sVh-t + Wi,q- 1 ^/s i - Si-l^tq-i ~ h. 

Note the contribution of the horizontal segment is zero and, by our assumptions, 
W p _ i t q_i and W it q_i are both greater than zero. Clearly C{h) is continuous for 
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t < h < t q -i, and is differentiable except at the endpoints of this /(.-interval. When 
h = t q _ i, the segment 7^ coincides with the segment of 7 under consideration. 
Clearly, as h —» t q - 1, C'{h) —>• —00, since the derivative of f(x) = yjx approaches 
00 as x —> 0 . This implies that for values of h within some some small interval 
(t q -1 — e,t q - 1], C(h) is a decreasing function of h , and so for h £ (t q -1 — 

C(/i) > G(t q _i). This contradicts the optimality of our original 7, and completes 
the proof of the Lemma. 


□ 


The lemmas we have proved show that a segment of Type I is always a P-segment 
and a segment of Type II is always an N-segment, establishing Statements ( 1 ) and 
( 2 ) of Theorem [6j What remains is to prove Statement ( 3 ) of Theorem [bj which 
gives a relationship between the final slope of a P-segment, and the initial slope of 
the next P-segment (whether or not there is an N-segment between them). 

First consider the case in which one P-segment of our optimal matching 7 ends 
at the vertex (si,tj), and the next one begins at the same point. Since these 
are P-segments, we already know that their slopes H t] in Gij and Hi + 17+1 in 

Gi+ lj+i are both positive. Let p = y / ' g (^ ,J W We need to prove that jj satisfies 
the appropriate inequalities given in Statement ( 3 ) of Theorem [6] Note that these 
inequalities depend on the sign of C and D. (Because we are dealing with P- 
segments, A and B must both be positive, by definition.) This argument proceeds 
by contradiction; we show that if /i is outside the prescribed intervals, then 7 is not 
optimal. 

We begin by assuming that D > 0 . In either of the two cases where this holds, 
the lower end of the prescribed interval for p is D 2 /AB. So, suppose that p < 
D 2 /AB. Choose a point on 7 in the interior of G,; r This point will be of the form 
(si — p : tj — q), where p, q > 0 . Likewise, choose a point on 7 in the interior of 
Gj+ij+i- This point will be of the form (si + u,tj + v), where u,v > 0 . Now, for 
arbitrary x £ [0, u] and y £ [0, q], consider a path 7^ = (7 ®’ v , y%’ v ), composed of 
the following three pieces: first, the line segment from (sj — p, tj — q ) to (sj, tj — y); 
second, the line segment from (sj, tj — y) to (s* +x, tj); third, the line segment from 
(si + x, tj) to (si + u, tj + v). Assume that the portion of 7 from (s* — p , tj — q) 
to (si + u,tj + v) corresponds to the parameter interval z £ [a,/ 3 ]. Parameterize 
7^’iy using this same parameter interval, and assume that it is linear on each of the 
three segments. Define 



By applying Lemma [ 16 ] to the three linear pieces of 7 x ’ y , we obtain 


E(x, y) = Ay/py/q - y + Dsfxy/y + B^/vy/u - x 


where we are in the case of A,B,D > 0 . It is an easy exercise in two-variable 
calculus that the function E(x,y) has a unique absolute maximum on the domain 
(x,y) £ [0,00) x [0,oo), and that this maximum occurs at the point 
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/ D 4 u-B 2 A 2 (*f)\ 

Vo = q 1 D 4 u + D 2 A 2 p J 

We now observe that this maximum (xo,yo) lies in ( 0 ,u) x ( 0 , q), as follows. First, 
note that every individual variable occurring in the expressions for xq and yo has 

a positive value. Furthermore, recall that y = \J Hi jj’ 3 ^ Since we 

are assuming that y < D 2 /AB , it follows immediately that the numerators in 
the formulae for both Xq and yo are positive and therefore Xo,yo > 0 - Since the 
numerator in the fraction for Xq is less than D 4 q, while the denominator is greater 
than D 4 q, it follows that xq < u and, similarly, that yo < q. Hence we have shown 
that ( Xo,yo ) lies in ( 0 , u) x ( 0 , q). Since E has an absolute maximum at (xo, yo), 
it follows that E{x 0 ,y 0 ) > E{ 0 , 0 ). But this contradicts the optimality of 7, since 
7 0 ’ 0 corresponds exactly to our original 7! 

Similarly, under the assumption that C > 0 , we show that y > ABIC 2 leads to 
a contradiction. This proves Statement ( 3 ) for two adjacent P-segments. 

The case of two P-segments separated by a single N-segment is similar. Suppose 
one P-segment ends at a vertex (sj, tj) and the next one starts at (st, fj), and there 
is an N-segment from ( Si,tj) to ( Sk,ti )• Once, again, we will assume we are in the 
case D > 0 , and suppose that y < D 2 /AB. Choose a point on 7 in the interior of 
Gij. This point will be of the form (s^ —p, tj — q ), where p, q > 0 . Likewise, choose 
a point on 7 in the interior of Gk+i.i+i- This point will be of the form (Sk +u, ti+v), 
where u, v > 0 . Note that the portion of 7 from (s* — p , tj — q) to (sj, + u,ti + v) 
consists of four line segments: first from (s* —p, tj — q) to (s,;, tj) , second from (sj, tj ) 
to (. Sk,tj ), third from ( Sk,tj) to (sk,ti), and fourth from ( Sk,ti ) to (.Sk + u,ti +v). 
Now, for arbitrary x £ [ 0 ,u] and y £ [ 0 , q], consider a path 7^ = ( 7 i’ y , 72 ,y )> 
composed of the following five line segments: first from (s, —p, tj — q) to (s,;, tj — y), 
second from ( Si,tj — y) to ( Sk,tj ~ y)i third from ( Sk,tj ~ y) to (sfc +x,tj), fourth 
from (sfc + x, tj) to (s*, + x, ti), and fifth from (s*, + x, ti) to (s^ + u,ti + v ). The 
rest of the argument proceeds just as before; the contribution of the integral over 
the segment "f x,v is again given by the formula 

E(x, y) = A^JpyJq - y + Ds/x^/y + B^vy/u - x 

since the horizontal and vertical segments have no contributions. By finding that 
the maximum value of E(x,y) does not occur at (x,y) = ( 0 , 0 ), we contradict the 
assumption that 7 was optimal. 

This completes the proof of Theorem [6] 

□ 


8 . Algorithm for Producing a Precise Optimal Matching of PL 

Curves 

In Theorem [ 6 j we proved that given PL curves f± and fa, there exists an optimal 
matching 7 = (71,72) that is a union of P-segments and N-segments. We now 
outline our algorithm for producing such an optimal matching. Throughout this 
section, we continue using the notation developed in the previous section for our 
curves /1 and fa and their SRVF’s qi and q 2 . We assume that the q[s are step 
functions that do not take the value zero on any of their subintervals. 
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The algorithm examines each vertex (s,, t :j ). one row at at time, in the order 

(s 0 ,t 0 ),(s±,to),(s2,to), ■ ■ •,(s 0 ,*i),(si,ti),(S2 ,ti ),..., (s m _i, t n ), (s m ,t n ) 

When it arrives at a vertex (si,tj), it checks whether an optimal segment has been 
found from (so,£o) to (,s*,tj). If no such optimal segment has been found, it skips 
to the next vertex. 

However, if such an optimal segment has been found, it implements a “search¬ 
light” procedure, looking for segments starting from (s % , tj ), as follows: 

• If Wi + ij_|_i < 0 , the algorithm finds all possible N-segments beginning at 
( Si,tj ). Suppose such an N-segment ends at ( Sk,ti). The algorithm checks 
whether the value of the optimal segment from (so,to) to ( Si,tj ) is higher 
than the value of the best segment found so far from (s 0 , to) to ( Sk,ti ). If it 
is, then the union of these two segments yields a new best possible segment 
from (so,fo) to (sfe,t/), and this segment is recorded as such. If it is not, 
then this N-segment is simply ignored. 

• If Wi+i,j+i > 0 , then the algorithm examines P-segments beginning at 
( Si,tj ). It does not have to examine all such P-segments, because of the 
slope restriction imposed by the last clause of Theorem [ 6 ] To be more 
precise, by considering the final slope of the last P-segment occurring in 
the optimal path from (s 0 ,£o) to () and the value of four relevant 
weights, the last clause of Theorem [G] specifies an allowable range of slopes 
for the next P-segment. Our searchlight procedure examines all P-segments 
beginning at (s,;, tj) whose initial slopes are within this range. (We will soon 
give some more details on how we accomplish the enumeration of these 
P-segments.) Suppose such a P-segment ends at (s*,, t/)- The algorithm 
checks whether the sum of the values of this new P-segment and the optimal 
segment from (so, to) to (s*, tj) is greater than the value of the best segment 
found so far from (so,£o) to ( Sk,ti). If it is greater, the union of these two 
segments yields a new candidate for best possible segment from (so,to) to 
( Sk,ti ), and this segment is recorded as such. If it is not, then this new 
P-segment is ignored. 

During the application of this algorithm, by the time we are examining a vertex 
(si,tj), we have already determined whether or not there exists a segment from 
(so,to) to (si,tj) that follows the rules of Theorem [ 6 ] 

Thus when we arrive at the final vertex (s m ,t n J 7 we will have determined the 
best possible segment from (so,to) to ( s m ,t n ). 

We now make some further comments on the searchlight procedure alluded to 
above. In the first case, we are searching for all possible N-segments starting at 
( Si,tj ). This can be accomplished by a relatively simple combinatorial procedure, 
searching for vertices above and to the right of ( Si,tj ) which will be the endpoint 
of an allowable N-segment. 

However, the searchlight procedure has more subtlety in the second case, where 
we are searching for all possible P-segments, with starting slope within a given 
interval, say \hi,ho\. Because of this subtlety, we give some more details about how 
this is accomplished. In order to make sure we don’t miss any allowable P-segments 
due to round-off error, we begin by choosing an initial slope hi — e, where e denotes 
some convenient small positive number. Then, we construct a segment beginning 
at the vertex (si,tj) and following the slope-change rules from the definition of 
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P-segment whenever we cross from one block to the next. There is essentially a 
zero probability that this segment will meet a vertex, so the segment ends when it 
arrives at either the vertical line s = 1 or the horizontal line t = 1 . Technically, 
this segment is not a P-segment, because its final point is not a vertex. The idea 
of the searchlight algorithm is that we want to find the next initial slope above 
(hi — e) that will result in a P-segment that actually terminates at a vertex. There 
is a nice trick for accomplishing this. Note that the slope of this segment changes 
each time it passes from one block to the next, because of the change in the weights 
as we pass from one block to the next. However, it is very easy to perform a PL 
reparameterization of the original curves, that will result in all the blocks that this 
path passes through having the same weight! For example, consider the case in 
which the first edge-crossing of our segment takes it from G i+ ij+i to Gi + 2,j+ 1- By 
choosing a linear reparameterization 7 : [sj + i, Sj+2] —i► [sj + i, s, + 2], we can change 
the value of the q-function of fi on this portion of the curve to any multiple of its 
original value iq+2 that we desire. Therefore, we can change the weight W l+ 2,j+i to 
make it equal to the weight Wj+i,j+i by such a reparameterization. (Of course, we 
must translate the values of Sk for all k > i + 2 in order to accommodate the new 
value of Si+2-)- Since the weights of these two blocks are now equal, it follows from 
the slope transition formula that the slope of the segment will now remain the same 
as our segment basses from G,;+ i,j+i to G,; + 2,j+i ■ We proceed along our segment, 
making a similar reparametrization of either fi or f-2 each time the segment passes 
from one block to the next. The result of this procedure will be that our entire 
segment has the same slope (equal to its initial slope in G i+ lj+i)- Note that the 
total parameter intervals will no longer be the unit intervals that they were to start 
with, but that doesn’t matter. Also, note that the coordinates of several of the 
vertices will have been changed by these reparameterizations. 

We need to find the lowest slope above hi — e for which the segment encounters 
a vertex. But, because the slopes are all the same along the segment, this becomes 
easy. Let S denote the set of vertices that are either the upper end of a vertical 
edge crossed by our segment, or the left end of a horizontal edge crossed by our 
segment. For each of the vertices ( Sk,ti ) in S, compute the ratio ti/sk', the lowest 
value of this ratio will obviously be the lowest initial slope above hi — e for which 
our segment encounters a vertex. Call this new slope hi. Going back to our original 
parameterizations, we have our first P-segment, starting at ( Si,tj ), with initial slope 
hi. 

To find the next P-segment, we begin by constructing a segment starting at 
( Si,tj ), with slope hi + e for a very small e, that follows the slope-change rules 
whenever it passes from one block to another. There is a zero probability that this 
segment will encounter a vertex, so it will end when it arrives at either the vertical 
line s = 1 or the horizontal line t = 1 . To find the next slope above hi + e that will 
yield a P-segment, we use exactly the same path straightening procedure that we 
just described. We proceed in this manner until we arrive at a slope above ft-2- This 
gives us all the P-segments starting at (si,tj) with initial slopes in the required 
range. Note that for each P-segment we construct, we just need to construct one 
“test” segment to find it. 
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9. Examples 

In the following pages, we present the results produced by implementing the 
aforementioned algorithm on different pairs of ID, 2 D and 3 D-curves. In case 
of lD-curves, A shows the original curves as graphs, B shows the aligned curves 
and C shows the optimal matching on / x I grid. In case of 2 D and 3 -D-curves, 
the alignment of the curves is shown in figure A, the geodesic is shown in B and 
C represents the optimal matching on the / x / grid for the pair of curves. The 
following table shows the list of the pairs of curves. 


EX 


1(1D) 

2(1-D) 

3(2 D) 
4(2 D) 
5(2 D) 
6(2 D) 
7(2 D) 
8(3 D) 
9(3 D) 


DESCRIPTION OF THE PAIRS OF CURVES 

r i 5 

/ \(t) = fit), /2(f) = git), t £ \f }„-o’ taken f rom a random data set 
flit) = f{t), f 2 {t) = git), t £ {xm} 1 =o i taken from a simulated data set 
flit) = it, fit)), f 2 (t) = it,git)), t £ { } ^,—0, taken from the female growth data set [XU] 

flit) = ( 2 ?rt, 2t rt) and f 2 {t) = (2?r t, sin(67rt)), t £ {^}^ =0 
flit) = (1 + cos(2tt(1 - t)), sin(27r(l — t))), f 2 (t) = (-1 + cos(-2tR), sin(27rt)) , t £ {||}^ 5 =0 
flit) = (1 + cos(2tt( 1 -t)),sin( 27 r(l -t))), f 2 {t) = (-1 + cos(-2tR), sin(27rt)) , t£ {f}^ =0 
flit) = (2?rt, sin(67rt)), f 2 {t) = (2t rt, sin( 47 rf)) , t £ {i|}^ 5 =0 
flit) = (cos 47 rt, sin 47 rt, t), f 2 (t) = (cos87t t, sin 87 rf, t) , t £ {gg }^ 0 
fi(t) = (Ant cos( 47 rt), Antsm( 4 nt), ( 47 r£) 2 ), f2(t) = ( 47 r£cos( 47 r£), — int sin( 47 r£), ( 47 r£) 2 ) , t G {^} 


50 

n—O 
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Figure 3 . Example 1 ( 1 D). Distance before alignment is 1 . 4815 . 
Distance after alignment is 0 . 5071 . 





Figure 4 . Example 2 ( 1 D). Distance before alignment is 1 . 4312 . 
Distance after alignment is 0 . 1195 . 
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A 


B 



Figure 5. Example 3(2 D). Distance before alignment is 7.0108. 
Distance after alignment is 4.0721. 





c 


Figure 6. Example 4(2D). Distance before alignment is 3.9107. 
Distance after alignment is 2.8418. 
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Figure 7. Example 5(2 D). Distance before alignment is 2.5064. 
Distance after alignment is 2.0683. 



Figure 8. Example 6(2 D). Distance before alignment is 2.4495. 
Distance after alignment is 2. 
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A B 



Figure 9. Example 7(2 D). Distance before alignment is 4.1655. 
Distance after alignment is 1.7899. 



Figure 10. Example 8(3 D). Distance before alignment is 6.1114. 
Distance after alignment is 3.2117. 
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Figure 11. Example 9(31?) . Distance before alignment is 8.5302. 
Distance after alignment is 8.5253. 




Figure 12. Comparing results from the Dynamic programming 
(top row) with the algorithm (bottom row). Distance before align¬ 
ment is 1.57. Distance after alignment : 1.5239 (using DP) ; 1.2457 
(using our algorithm). 
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